Skip to content
View shawntan's full-sized avatar

Organizations

@nushackers @basement-gang

Block or report shawntan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. neural-turing-machines neural-turing-machines Public

    Attempt at implementing system described in "Neural Turing Machines." by Graves, Alex, Greg Wayne, and Ivo Danihelka. (http://arxiv.org/abs/1410.5401)

    Jupyter Notebook 461 96

  2. scattermoe scattermoe Public

    Triton-based implementation of Sparse Mixture of Experts.

    Python 197 16

  3. SUT SUT Public

    Repository for Sparse Universal Transformers

    Python 17 1

  4. stickbreaking-attention stickbreaking-attention Public

    Stick-breaking attention

    Python 43 1

  5. IBM/dolomite-engine IBM/dolomite-engine Public

    Dolomite Engine is a library for pretraining/finetuning LLMs

    Python 36 11