dynatrace-oss/llama-embed-mamba2-7b
Sentence Similarity • 7B • Updated • 45 • 3
Text embedding models based on Mamba2 with linear-time and constant-memory inference through vertical chunking.
Note Adjusted Mamba2 model code and kernels to support chunked inference with recurrent state passing.
Note Chunkable transformer and last index pooling modules for sentence transformers enabling constant-memory encoding.