nlp-datasets

Star

Here are 154 public repositories matching this topic...

sammitjain / loksabha-questions

Star

Questions asked in the Lok Sabha - collection and analysis of trends. Creating the dataset from scratch.

dataset india government-data public-policy nlp-machine-learning nlp-datasets

Updated Apr 21, 2022
Jupyter Notebook

PranavNV / Nationality-Prejudice-in-Text-Generation

Star

This project focuses on the analysis of text generation models such as GPT-2 to identify and understand populistic behaviors or biases against various nationality.

social-infomatics nlp-datasets ethics-in-ai

Updated Mar 14, 2023

SemiringInc / Mueller-Report-Corpus

Star

The Mueller Report Corpus V 0.1

nlp corpus corpus-linguistics nlp-datasets

Updated May 12, 2020

vgupta123 / infotabs-code

Star

Implementation of the semi-structured inference model in our ACL 2020 paper. INFOTABS: Inference on Tables as Semi-structured Data

nlp wikipedia svm transformers transformer tables snli mnli nli nlp-datasets roberta acl2020

Updated May 7, 2020
Python

BrunoGianetti / MyNLPProjects

Star

My project storage in NLP

Updated Feb 15, 2024
Jupyter Notebook

Shayokh144 / Bengali-Literature-Data-Collection

Star

nlp-datasets bengali-dataset

Updated Aug 11, 2020

Robin1999Stark / Recipe_Tagger

Star

NLP Project for Auto Labeling Receipes

python nlp ai pipeline tags tag languages tagger token mit-license ner piplines nlp-datasets alogirthm tokeniz augsburg-university

Updated Feb 28, 2024
Python

ArmanBehnam / NLP

Star

Natural language processing including Datasets,Farsi NLP, Automated Essay Scoring, Automatic Speech Recognition and etc.

nlp natural-language-processing tutorial language-modeling dataset persian natural-language-generation nlp-resources language-model farsi natural-language-inference nlp-machine-learning nlp-datasets

Updated Oct 14, 2020
Jupyter Notebook

claudiu1989 / Synonyms-detection

Star

Experiments with word2vec embeddings for synonyms detection, for the Romanian language.

nlp embeddings romanian nlp-resources nlp-machine-learning nlp-datasets

Updated Sep 10, 2023
Python

Kevinlee49 / analysis-youtube-comment-krisandme

Star

I tried to figure out positive and negative comments on my Youtube videos. So, I used NLP to analyze comments. I set the main language as Korean, but you can try setting English as the main language.

nlp nlp-machine-learning nlp-datasets

Updated Aug 23, 2021
Jupyter Notebook

LIAAD / PT-Pump-Up

Star

Hub for the Portuguese language NLP Resources

nlp natural-language-processing resources nlp-resources portuguese-language nlp-datasets

Updated Apr 18, 2024
PHP

Robert-Morabito / STOP

Star

Repository for the paper STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions (EMNLP 2024)

acl nlp-resources implicit-bias nlp-datasets bias-detection explicit-bias large-language-models emnlp2024

Updated Sep 24, 2024
Python

Blacksujit / Sentiment-Analysis

Sponsor

Star

This project is a sentiment analysis model built to classify IMDB movie reviews as either positive or negative using the **IMDB dataset**. It uses various machine learning models and deep learning techniques to handle the text data.