github

abacaj / awesome-transformers

  • вторник, 28 февраля 2023 г. в 00:13:44
https://github.com/abacaj/awesome-transformers


A curated list of awesome transformer models.



Awesome Transformers

Transformers

A curated list of awesome transformer models.

If you want to contribute to this list, send a pull request or reach out to me on twitter: @abacaj. Let's make this list useful.

There are a number of models available that are not entirely open source (non-commercial, etc), this repository should serve to also make you aware of that. Tracking the original source/company of the model will help.

I would also eventually like to add model use cases. So it is easier for others to find the right one to fine-tune.

Format:

  • Model name: short description, usually from paper
    • Model link (usually huggingface or github)
    • Paper link
    • Source as company or group
    • Model license

Table of Contents

Encoder models

  • ALBERT: "A Lite" version of BERT
  • BERT: Bidirectional Encoder Representations from Transformers
  • DistilBERT: Distilled version of BERT smaller, faster, cheaper and lighter
  • DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
  • Electra: Pre-training Text Encoders as Discriminators Rather Than Generators
  • RoBERTa: Robustly Optimized BERT Pretraining Approach

Decoder models

Encoder+decoder (seq2seq) models

Multimodal models

Vision models

Audio models

Recommendation models