Similar repositories to pmichel31415/are-16-heads-really-better-than-1:
pmichel31415/are-16-heads-really-better-than-1
github
similar
hanxiao/bert-as-service
github
similar
danistefanovic/build-your-own-x
github
similar
pandas-profiling/pandas-profiling
github
similar
kon9chunkit/GitHub-Chinese-Top-Charts
github
similar
tuvtran/project-based-learning
github
similar
yandex/YaLM-100B
github
similar
srush/GPU-Puzzles
github
similar
cgnorthcutt/cleanlab
github
similar
sannykim/transformers
github
similar
lab-ml/annotated_deep_learning_paper_implementations
github
similar
lena-voita/the-story-of-heads
github
similar
lab-ml/labml
github
similar
castorini/DeeBERT
github
similar
yzh119/BPT
github
similar
clovaai/length-adaptive-transformer
github
similar
intersun/CoDIR
github
similar
neulab/RIPPLe
github
similar
laiguokun/Funnel-Transformer
github
similar
JetRunner/BERT-of-Theseus
github
similar
intersun/PKD-for-BERT-Model-Compression
github
similar
harvardnlp/cascaded-generation
github
similar
clarkkev/attention-analysis
github
similar
mandarjoshi90/pair2vec
github
similar
artetxem/uncovec
github
similar
facebookresearch/adaptive-span
github
similar
ofirpress/shortformer
github
similar
nelson-liu/contextual-repr-analysis
github
similar
VITA-Group/BERT-Tickets
github
similar
IBM/PoWER-BERT
github
similar
MC-BERT/MC-BERT
github
similar
qijiezhao/M2Det
github
similar
NVIDIA/TRTorch
github
similar
xplip/pixel
github
similar
uwnlp/denspi
github
similar
harvardnlp/urnng
github
similar
alexa/alexa-dataset-contextual-query-rewrite
github
similar
bckim92/sequential-knowledge-transformer
github
similar
mrqa/MRQA-Shared-Task-2019
github
similar
qkaren/converse_reading_cmr
github
similar