Skip to content

Learning word categories using pre-trained transformer embeddings

License

Notifications You must be signed in to change notification settings

glicerico/wordcat_transformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wordcat-transformer

Learning word categories using pre-trained transformer embeddings.

Simple example to run word categorizer:

python src/word_categorizer.py --sentences sentences/1sentence.txt 
                               --vocab vocabularies/microtest.vocab 
                               --clusterer OPTICS
                               --save_to microtest 
                               --pickle_emb microtest.pickle 
                               --masks 3 --verbose

For full options, see code's main function documentation

About

Learning word categories using pre-trained transformer embeddings

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published