thomwolf
|
eebc8abbe2
|
clarify and unify model saving logic in examples
|
2019-02-11 14:04:19 +01:00 |
thomwolf
|
81c7e3ec9f
|
fix typo in readme
|
2019-02-11 13:37:12 +01:00 |
thomwolf
|
884ca81d87
|
transposing the inputs of Transformer-XL to have a unified interface
|
2019-02-11 13:19:59 +01:00 |
thomwolf
|
32fea876bb
|
add distant debugging to run_transfo_xl
|
2019-02-11 12:53:32 +01:00 |
thomwolf
|
b31ba23913
|
cuda on in the examples by default
|
2019-02-11 12:15:43 +01:00 |
thomwolf
|
2071a9b86e
|
fix python 2.7 imports
|
2019-02-11 10:35:36 +01:00 |
thomwolf
|
b514a60c36
|
added tests for OpenAI GPT and Transformer-XL tokenizers
|
2019-02-11 10:17:16 +01:00 |
thomwolf
|
9f9909ea2f
|
update readme
|
2019-02-09 16:59:21 +01:00 |
thomwolf
|
0c1a6f9b1d
|
update readme
|
2019-02-08 22:32:25 +01:00 |
thomwolf
|
009b581316
|
updated readme
|
2019-02-07 23:15:05 +01:00 |
thomwolf
|
f99f2fb661
|
docstrings
|
2019-02-07 17:07:22 +01:00 |
Thomas Wolf
|
848aae49e1
|
Merge branch 'master' into python_2
|
2019-02-06 00:13:20 +01:00 |
thomwolf
|
ba37ddc5ce
|
fix run_lm_modeling example command line
|
2019-02-06 00:07:08 +01:00 |
Girishkumar
|
0dd2b750ca
|
Minor update in README
Update links to classes in `modeling.py`
|
2019-01-30 23:49:15 +05:30 |
thomwolf
|
3a848111e6
|
update config, docstrings and readme to switch to seperated tokens and position embeddings
|
2019-01-29 11:00:11 +01:00 |
Davide Fiocco
|
35115eaf93
|
(very) minor update to README
|
2019-01-16 21:05:24 +01:00 |
nhatchan
|
8edc898f63
|
Fix documentation (missing backslashes)
This PR adds missing backslashes in LM Fine-tuning subsection in README.md.
|
2019-01-13 21:23:19 +09:00 |
thomwolf
|
e5c78c6684
|
update readme and few typos
|
2019-01-10 01:40:00 +01:00 |
thomwolf
|
fa5222c296
|
update readme
|
2019-01-10 01:25:28 +01:00 |
Thomas Wolf
|
c18bdb4433
|
Merge pull request #124 from deepset-ai/master
Add example for fine tuning BERT language model
|
2019-01-07 12:03:51 +01:00 |
Julien Chaumond
|
8da280ebbe
|
Setup CI
|
2018-12-20 16:33:39 -05:00 |
tholor
|
e5fc98c542
|
add exemplary training data. update to nvidia apex. refactor 'item -> line in doc' mapping. add warning for unknown word.
|
2018-12-20 18:30:52 +01:00 |
tholor
|
67f4dd56a3
|
update readme for run_lm_finetuning
|
2018-12-19 09:22:37 +01:00 |
Julien Chaumond
|
d57763f582
|
Fix typos
|
2018-12-18 19:23:22 -05:00 |
Thomas Wolf
|
786cc41299
|
Typos in readme
|
2018-12-17 09:22:18 +01:00 |
Daniel Khashabi
|
8b1b93947f
|
Minor fix.
|
2018-12-14 14:10:36 -05:00 |
Thomas Wolf
|
8809eb6c93
|
update readme with information on NVIDIA's apex
|
2018-12-14 16:59:39 +01:00 |
thomwolf
|
d821358884
|
update readme
|
2018-12-14 15:15:17 +01:00 |
thomwolf
|
087798b7fa
|
fix reloading model for evaluation in examples
|
2018-12-13 14:48:12 +01:00 |
thomwolf
|
0f544625f4
|
fix swag example for work with apex
|
2018-12-13 13:35:59 +01:00 |
thomwolf
|
4946c2c500
|
run_swag example in readme
|
2018-12-13 13:02:07 +01:00 |
Thomas Wolf
|
91aab2a6d3
|
Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex
Change to use apex for better fp16 and multi-gpu support
|
2018-12-13 12:32:37 +01:00 |
Thomas Wolf
|
ffe9075f48
|
Merge pull request #96 from rodgzilla/multiple-choice-code
BertForMultipleChoice and Swag dataset example.
|
2018-12-13 12:05:11 +01:00 |
Grégory Châtel
|
dcb50eaa4b
|
Swag example readme section update with gradient accumulation run.
|
2018-12-12 18:17:46 +01:00 |
Deyu Fu
|
c8ea286048
|
change to apex for better fp16 and multi-gpu support
|
2018-12-11 17:13:58 -08:00 |
Thomas Wolf
|
a3a3180c86
|
Bump up requirements to Python 3.6
|
2018-12-11 11:29:45 +01:00 |
Grégory Châtel
|
0876b77f7f
|
Change to the README file to add SWAG results.
|
2018-12-10 15:34:19 +01:00 |
Davide Fiocco
|
c9f67e037c
|
Adding --do_lower_case for all uncased BERTs
I had missed those, it should make sense to use them
|
2018-12-07 20:40:56 +01:00 |
Grégory Châtel
|
150f3cd9fa
|
Few typos in README.md
|
2018-12-06 19:22:07 +01:00 |
Grégory Châtel
|
4fa7892d64
|
Wrong line number link to modeling file.
|
2018-12-06 19:18:29 +01:00 |
Grégory Châtel
|
6a26e19ea3
|
Updating README.md with SWAG example informations.
|
2018-12-06 19:15:08 +01:00 |
Grégory Châtel
|
0a7c8bdcac
|
Fixing badly formatted links.
|
2018-12-04 13:43:56 +01:00 |
Grégory Châtel
|
3113e967db
|
Adding links to examples files.
|
2018-12-04 13:40:38 +01:00 |
Davide Fiocco
|
8a8aa59d8c
|
Update finetuning example adding --do_lower_case
Should be consistent with the fact that an uncased model is used
|
2018-12-01 01:00:05 +01:00 |
thomwolf
|
f9f3bdd60b
|
update readme
|
2018-11-30 23:05:18 +01:00 |
thomwolf
|
52ff0590ff
|
tup => tpu
|
2018-11-30 23:01:10 +01:00 |
thomwolf
|
296f006132
|
added BertForTokenClassification model
|
2018-11-30 13:56:53 +01:00 |
thomwolf
|
298107fed7
|
Added new bert models
|
2018-11-30 13:56:02 +01:00 |
Davide Fiocco
|
ec2c339b53
|
Updated quick-start example with `BertForMaskedLM`
As `convert_ids_to_tokens` returns a list, the code in the README currently throws an `AssertionError`, so I propose I quick fix.
|
2018-11-28 14:53:46 +01:00 |
thomwolf
|
05053d163c
|
update cache_dir in readme and examples
|
2018-11-26 10:45:13 +01:00 |