Questions asked in the Lok Sabha - collection and analysis of trends. Creating the dataset from scratch.
-
Updated
Apr 21, 2022 - Jupyter Notebook
Questions asked in the Lok Sabha - collection and analysis of trends. Creating the dataset from scratch.
This project focuses on the analysis of text generation models such as GPT-2 to identify and understand populistic behaviors or biases against various nationality.
Implementation of the semi-structured inference model in our ACL 2020 paper. INFOTABS: Inference on Tables as Semi-structured Data
My project storage in NLP
Natural language processing including Datasets,Farsi NLP, Automated Essay Scoring, Automatic Speech Recognition and etc.
Experiments with word2vec embeddings for synonyms detection, for the Romanian language.
I tried to figure out positive and negative comments on my Youtube videos. So, I used NLP to analyze comments. I set the main language as Korean, but you can try setting English as the main language.
Hub for the Portuguese language NLP Resources
Repository for the paper STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions (EMNLP 2024)
This project is a sentiment analysis model built to classify IMDB movie reviews as either positive or negative using the **IMDB dataset**. It uses various machine learning models and deep learning techniques to handle the text data.
Python program for detecting unintentional bilingual and translation instances in NLP datasets.
Extract Abstract and Title Dataset from arXiv articles
Repo for Turkish sentiment analysis dataset, "Vitamins and Supplements Customer Reviews"
10 languages are classified using the stopwords included in the nltk library.
Add a description, image, and links to the nlp-datasets topic page so that developers can more easily learn about it.
To associate your repository with the nlp-datasets topic, visit your repo's landing page and select "manage topics."