Github Repositories Trend

niderhoff/nlp-datasets

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Total stars
3,990
Stars per day
3
Created at
4 years ago
Related Repositories
awesome-information-retrieval
A curated list of awesome information retrieval resources
gensim-data
Data repository for pretrained NLP models and NLP corpora.
awesome-spanish-nlp
Curated list of Linguistic Resources for doing NLP & CL on Spanish
awesome-public-datasets
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone.
awesome-project-ideas
Curated list of Machine Learning, NLP, Vision Project Ideas
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
MovieTweetings
A Live Movie Rating Dataset Collected From Twitter
data
Open Data Sources
nlp-datasets
A list of datasets/corpora for NLP tasks, in reverse chronological order.


harpribot/awesome-information-retrieval

A curated list of awesome information retrieval resources
Total stars
565
Related Repositories
Link

RaRe-Technologies/gensim-data

Data repository for pretrained NLP models and NLP corpora.
Homepage
https://rare-technologies.com/new-api-for-pretrained-nlp-models-and-datasets-in-gensim/
Total stars
506
Language
Python
Related Repositories
Link

dav009/awesome-spanish-nlp

Curated list of Linguistic Resources for doing NLP & CL on Spanish
Total stars
248
Related Repositories
Link

caesar0301/awesome-public-datasets

An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone.
Total stars
39,454
Related Repositories
Link

NirantK/awesome-project-ideas

Curated list of Machine Learning, NLP, Vision Project Ideas
Homepage
http://www.nirantk.in/awesome-project-ideas/
Total stars
3,841
Related Repositories
Link

juand-r/entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Total stars
656
Language
Python
Related Repositories
Link

sebastianruder/NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Total stars
15,798
Related Repositories
Link

sidooms/MovieTweetings

A Live Movie Rating Dataset Collected From Twitter
Total stars
287
Related Repositories
Link

datasciencemasters/data

Open Data Sources
Total stars
380
Related Repositories
Link

karthikncode/nlp-datasets

A list of datasets/corpora for NLP tasks, in reverse chronological order.
Total stars
872
Related Repositories
Link