Microsoft DeBERTa surpasses puny humans in SuperGlue reading comprehension test
There has been massive progress recently in training networks with millions of parameters. Microsoft recently updated the DeBERTa (Decoding-enhanced BERT with disentangled attention) model by training a…
