S(m)AMBA

Authors

Description

Custom implementation of Mamba state space model pretrained for NLP tasks.

S(m)AMBA reflects our vision of how S4 and Mamba should look like. We trained this model on four different datasets:

Shakespeare - collection of Shakespeare's writings.
Harry Tinny - "Boy Who Lived" chapter of the first book in the famous series.
Harry 1 - entire first book.
Harry Full - first 4 books in the series.

Demo

What we have achieved with this network:

Usage

Training

python train.py --config configs/<model>_config.yaml

For config file you can chose one of already available configs (in the configs directory), or create or own.

Inference

For inference you can also choose from predifined configs with pretrained models.

python inference.py \
    --config configs/<model>_config.yaml \
    --checkpoint weights/<model>.pth \
    --string "Example prompt." \
    --length 256

License

Unlicensed

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
artifacts		artifacts
configs		configs
data		data
modules		modules
tokenizers		tokenizers
utils		utils
weights		weights
.gitignore		.gitignore
README.md		README.md
inference.py		inference.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S(m)AMBA

Authors

Description

Demo

Usage

Training

Inference

License

About

Releases

Packages

Contributors 4

Languages

Tuchis/smamba

Folders and files

Latest commit

History

Repository files navigation

S(m)AMBA

Authors

Description

Demo

Usage

Training

Inference

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages