Skip to content

    xai-org/grok-1

    Repository files navigation

    Grok-1

    This repository contains JAX example code for loading and running the Grok-1 open-weights model.

    Make sure to download the checkpoint and place the ckpt-0 directory in checkpoints - see Downloading the weights

    Then, run

    pip install -r requirements.txt
    python run.py

    to test the code.

    The script loads the checkpoint and samples from the model on a test input.

    Due to the large size of the model (314B parameters), a machine with enough GPU memory is required to test the model with the example code. The implementation of the MoE layer in this repository is not efficient. The implementation was chosen to avoid the need for custom kernels to validate the correctness of the model.

    Model Specifications

    Grok-1 is currently designed with the following specifications:

    • Parameters: 314B
    • Architecture: Mixture of 8 Experts (MoE)
    • Experts Utilization: 2 experts used per token
    • Layers: 64
    • Attention Heads: 48 for queries, 8 for keys/values
    • Embedding Size: 6,144
    • Tokenization: SentencePiece tokenizer with 131,072 tokens
    • Additional Features:
      • Rotary embeddings (RoPE)
      • Supports activation sharding and 8-bit quantization
    • Maximum Sequence Length (context): 8,192 tokens

    Downloading the weights

    You can download the weights using a torrent client and this magnet link:

    magnet:?xt=urn:btih:5f96d43576e3d386c9ba65b883210a393b68210e&tr=https%3A%2F%2Facademictorrents.com%2Fannounce.php&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce
    

    or directly using HuggingFace ?? Hub:

    git clone https://github.com/xai-org/grok-1.git && cd grok-1
    pip install huggingface_hub[hf_transfer]
    huggingface-cli download xai-org/grok-1 --repo-type model --include ckpt-0/* --local-dir checkpoints --local-dir-use-symlinks False
    

    License

    The code and associated Grok-1 weights in this release are licensed under the Apache 2.0 license. The license only applies to the source files in this repository and the model weights of Grok-1.

    About

    Grok open release

    Resources

    License

    Code of conduct

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Languages

    主站蜘蛛池模板: 亚洲国产精品一区| 伊人无码精品久久一区二区| 国产伦精品一区二区免费 | 国产美女av在线一区| 一区在线免费观看| 国产一区中文字幕在线观看| 蜜桃视频一区二区| 波多野结衣一区二区三区aV高清| 亚洲综合无码精品一区二区三区| 国精产品一区二区三区糖心| 日韩免费无码一区二区三区| 久久久无码精品人妻一区| 精品不卡一区中文字幕| 亚洲欧洲∨国产一区二区三区| 天堂国产一区二区三区| 亚洲一区二区三区丝袜| 天码av无码一区二区三区四区| 免费一区二区无码东京热| 射精专区一区二区朝鲜| 国产精品视频免费一区二区| av无码精品一区二区三区四区| 国产精品日本一区二区不卡视频| 成人免费观看一区二区| 国产vr一区二区在线观看| 少妇特黄A一区二区三区| 欧洲精品免费一区二区三区| 中文字幕人妻丝袜乱一区三区| 国产短视频精品一区二区三区| 国产在线一区二区视频| 无码一区二区三区免费| 国产一区二区三区在线观看影院| 一区二区乱子伦在线播放| 韩国一区二区三区视频| 久久精品一区二区三区日韩| 卡通动漫中文字幕第一区| 精品视频无码一区二区三区| 无遮挡免费一区二区三区| 五月婷婷一区二区| 国产乱码一区二区三区爽爽爽| 国产精品香蕉一区二区三区| 国产在线视频一区二区三区98 |