Skip to content

    xai-org/grok-1

    Repository files navigation

    Grok-1

    This repository contains JAX example code for loading and running the Grok-1 open-weights model.

    Make sure to download the checkpoint and place the ckpt-0 directory in checkpoints - see Downloading the weights

    Then, run

    pip install -r requirements.txt
    python run.py

    to test the code.

    The script loads the checkpoint and samples from the model on a test input.

    Due to the large size of the model (314B parameters), a machine with enough GPU memory is required to test the model with the example code. The implementation of the MoE layer in this repository is not efficient. The implementation was chosen to avoid the need for custom kernels to validate the correctness of the model.

    Model Specifications

    Grok-1 is currently designed with the following specifications:

    • Parameters: 314B
    • Architecture: Mixture of 8 Experts (MoE)
    • Experts Utilization: 2 experts used per token
    • Layers: 64
    • Attention Heads: 48 for queries, 8 for keys/values
    • Embedding Size: 6,144
    • Tokenization: SentencePiece tokenizer with 131,072 tokens
    • Additional Features:
      • Rotary embeddings (RoPE)
      • Supports activation sharding and 8-bit quantization
    • Maximum Sequence Length (context): 8,192 tokens

    Downloading the weights

    You can download the weights using a torrent client and this magnet link:

    magnet:?xt=urn:btih:5f96d43576e3d386c9ba65b883210a393b68210e&tr=https%3A%2F%2Facademictorrents.com%2Fannounce.php&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce
    

    or directly using HuggingFace ?? Hub:

    git clone https://github.com/xai-org/grok-1.git && cd grok-1
    pip install huggingface_hub[hf_transfer]
    huggingface-cli download xai-org/grok-1 --repo-type model --include ckpt-0/* --local-dir checkpoints --local-dir-use-symlinks False
    

    License

    The code and associated Grok-1 weights in this release are licensed under the Apache 2.0 license. The license only applies to the source files in this repository and the model weights of Grok-1.

    About

    Grok open release

    Resources

    License

    Code of conduct

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Contributors 7

    Languages

    主站蜘蛛池模板: 91福利一区二区| 国产午夜三级一区二区三| 精品国产一区二区三区免费| 亚洲中文字幕乱码一区| 国偷自产av一区二区三区| 国产免费无码一区二区| 国产成人一区二区三区电影网站| 91video国产一区| 亚洲AV无码一区二区乱子伦 | 日本无码一区二区三区白峰美| 精品国产一区二区22 | 无码av人妻一区二区三区四区| 日本一区二区三区在线观看 | 一区二区三区无码高清视频| 日本人的色道www免费一区 | 亚洲熟妇av一区二区三区漫画| 日本一区二区免费看| 国产精品一区二区久久精品无码| 免费无码AV一区二区| 国产激情无码一区二区三区| 国产乱码精品一区二区三区中文| 精品国产AV一区二区三区| 男人免费视频一区二区在线观看 | 在线电影一区二区| 日本人真淫视频一区二区三区| 国产午夜精品一区二区三区小说| 高清国产AV一区二区三区| 国产对白精品刺激一区二区| 日本精品高清一区二区2021| 91国在线啪精品一区| 精品亚洲AV无码一区二区三区| 国产一区二区三区内射高清| 日韩精品一区二区三区四区| 免费一区二区三区四区五区| 日韩精品乱码AV一区二区| 久久无码人妻精品一区二区三区| 波多野结衣精品一区二区三区| 中文乱码人妻系列一区二区| 国产成人精品日本亚洲专一区 | 国产色欲AV一区二区三区| 99国产精品欧美一区二区三区|