LonePatient Electra Pytorch Save

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Project README

electra_pytorch

This repository contains a PyTorch implementation of the electra model from the paper

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

by Kevin Clark. Minh-Thang Luong. Quoc V. Le. Christopher D. Manning

NOTE： 🤗This version is experience version,and the offical PyTorch version is waiting for the update of 🤗huggingface

NOTE: 2020-04-08 ELECTRA is now available in PyTorch through the HuggingFace Transformers library! https://github.com/huggingface/transformers/releases/tag/v2.8.0

Dependencies

pytorch=1.10+
cuda=9.0
cudnn=7.5
scikit-learn
sentencepiece
python3.6+

Download Pre-trained Models

English: Official download links: google electra

Chinese:

Fine-tuning

１. Place config.json into the prev_trained_model/electra_base directory. example:

├── prev_trained_model
|  └── electra_base
|  |  └── pytorch_model.bin
|  |  └── config.json
|  |  └── vocab.txt

2．convert electra tf checkpoint to pytorch

python convert_electra_tf_checkpoint_to_pytorch.py \
    --tf_checkpoint_path=./prev_trained_model/electra_large \
    --electra_config_file=./prev_trained_model/electra_large/config.json \
    --pytorch_dump_path=./prev_trained_model/electra_large/pytorch_model.bin

Before running anyone of these GLUE/CLUE tasks you should download the GLUE data /CLUE data by running script named download_xxxx_data in the directorytools and unpack it to some directory $DATA_DIR.

3．run sh scripts/run_classifier_sst2.shto fine tuning albert model

Result

Performance of electra on GLUE benchmark results using a single-model setup on dev:

	Cola	Sst-2	Mnli	Sts-b
metrics	matthews_corrcoef	accuracy	accuracy	pearson
electra_small	56.6	90.5		87.6
electra_base	67.8	94.2		91.1
electra_large	71.1	95.8		92.4

Performance of electra on CLUE benchmark results using a single-model setup on dev:

	AFQMC	TNEWS	IFLYTEK
metrics	accuracy	accuracy	accuracy
electra_tiny	69.82	54.48	56.98

Performance of electra on chnsenticorp results using a single-model setup on dev:

	chnsenticorp
metrics	accuracy
electra_small	92.75
electra_base	94.08

pretraining

test on small dataset.

Open Source Agenda is not affiliated with "LonePatient Electra Pytorch" Project. README Source: lonePatient/electra_pytorch

Stars

Open Issues

Last Commit

2 years ago

Repository

lonePatient/electra_pytorch

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/lonepatient-electra-pytorch"><img src="https://www.opensourceagenda.com/projects/lonepatient-electra-pytorch/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022