CLUEbenchmark ELECTRA Save

中文预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model

Project README

ELECTRA

中文预训练 ELECTREA 模型: 基于对抗学习 pretrain Chinese Model

code Repost from google official code: https://github.com/google-research/electra

具体使用说明：参考官方链接

Electra Chinese tiny模型路径

google drive

electra-tiny google-drive

baidu drive

electra-tiny baidu-pan code:rs99

模型说明

与 tinyBERT 的配置相同
generator 为 discriminator的 1/4

How to use official code

Steps

修改 configure_pretraining.py 里面的数据路径、tpu、gpu 配置
修改 model_size：可在 code/util/training_utils.py 里面自行定义模型大小
数据输入格式：原始的input_ids, input_mask, segment_ids，训练过程中会在线做 uniform mask sampling（不需要离线生成 masked input ids）

Performance

gen+disc:

electra-tiny

metric	value
disc_accuracy	0.95093095
disc_auc	0.9762006
disc_loss	0.14071295
disc_precision	0.8018275
disc_recall	0.6088053
loss	9.516352
masked_lm_accuracy	0.46732807
masked_lm_loss	2.8209455
sampled_masked_lm_accuracy	0.3504382

The model are trained on CLUE 10G Chinese Corpus with 1M-steps

Downstream finetuning on CLUE benchmark:

注：only use pretrained electra-tiny with layer-wise learning rate decay without any distilaltion、data-augmentation. learning rate is set to 1e-4 for each task and run 10-epochs. (According to official results, the results may have large variance)

	AFQMC	TNEWS	IFLYTEK	CMNLI	WSC	CSL
Metrics	Acc	Acc	Acc	Acc	Acc	Acc
ELECTRA-tiny	70.319	54.280	53.538	73.745	64.336	78.700
Roberta-tiny	69.904	54.150	56.808	74.037	64.336	74.133

注：

electra 在多分类问题上面可能会有 performance 下降
gen、disc的规模配比比较hacky，与 mask的方法等相关

报名NLPCC-高性能小模型测评

Open Source Agenda is not affiliated with "CLUEbenchmark ELECTRA" Project. README Source: CLUEbenchmark/ELECTRA

Stars

141

Open Issues

Last Commit

4 years ago

Repository

CLUEbenchmark/ELECTRA

Homepage

https://openreview.net/forum?id=r1xMH1BtvB

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/cluebenchmark-electra"><img src="https://www.opensourceagenda.com/projects/cluebenchmark-electra/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022