Skip to content

Tuchis/smamba

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

S(m)AMBA

Authors

Description

Custom implementation of Mamba state space model pretrained for NLP tasks.

S(m)AMBA reflects our vision of how S4 and Mamba should look like. We trained this model on four different datasets:

  1. Shakespeare - collection of Shakespeare's writings.
  2. Harry Tinny - "Boy Who Lived" chapter of the first book in the famous series.
  3. Harry 1 - entire first book.
  4. Harry Full - first 4 books in the series.

Demo

What we have achieved with this network: demo_gif

Usage

Training

python train.py --config configs/<model>_config.yaml

For config file you can chose one of already available configs (in the configs directory), or create or own.

Inference

For inference you can also choose from predifined configs with pretrained models.

python inference.py \
    --config configs/<model>_config.yaml \
    --checkpoint weights/<model>.pth \
    --string "Example prompt." \
    --length 256

License

Unlicensed

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages