https://github.com/kimrass/bert

'BERT' (Devlin et al., 2019) implementation from scratch in PyTorch
https://github.com/kimrass/bert

bert bookcorpus

Last synced: about 2 months ago
JSON representation

'BERT' (Devlin et al., 2019) implementation from scratch in PyTorch

Host: GitHub
URL: https://github.com/kimrass/bert
Owner: KimRass
Created: 2023-08-26T14:46:43.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-02-10T13:57:16.000Z (over 1 year ago)
Last Synced: 2024-02-11T05:27:50.556Z (over 1 year ago)
Topics: bert, bookcorpus
Language: Python
Homepage:
Size: 2.55 MB
Stars: 2
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# BERT from scratch
- Please make sure that I was not able to complete pre-training or fine-tuning the model because of my compute environment but I checked that both pre-training and fine-tuning are well performed.

# Paper Reading
- [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding](https://arxiv.org/pdf/1810.04805.pdf)

# Research
- BERT 논문의 내용처럼 `Adam(model.parameters(), lr=1e-4, betas=(0.9, 0.999), weight_decay=0.01)`로 할 경우 Pre-training 진행되지 않는 현상을 확인했습니다. 아무리 학습을 시켜도 NSP에 있어서 Loss 값이 0.69 이하로 떨어지지 않았습니다. `weight_decay=0.01` Term을 없애자 정상적으로 학습이 이루어지는 것을 확인했습니다.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kimrass/bert

Awesome Lists containing this project

README