#

differential-transformer

Here are 2 public repositories matching this topic...

nanowell / Differential-Transformer-PyTorch

PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incorporates a novel Differential Attention mechanism, Multi-Head structure, RMSNorm, and SwiGLU.

machine-learning pytorch large-language-models differential-transformer

Updated Oct 27, 2024
Python

FENRlR / DTF-VITS

An experimental variation of VITS with Microsoft's Differential Transformer method applied on its text encoder.

tts vits differential-transformer

Updated Nov 20, 2024
Python

Improve this page

Add a description, image, and links to the differential-transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the differential-transformer topic, visit your repo's landing page and select "manage topics."