Name: Md Nishat Raihan
Type: User
Company: George Mason University
Bio: I'm a 3rd year CS Ph.D. Student at George Mason University. My primary research interest is NLP but I have also worked with Mining and Vision algorithms.
Twitter: MdNishatRaihan
Location: Fairfax, Virginia
Blog: https://md-nishat.com/
Md Nishat Raihan's Projects
A curated list of language modeling researches for code and related datasets.
A curated list of awesome Mojo 🔥 frameworks, libraries, software and resources
This is a dataset for the offensive language detection task. It contains 100k code mixed data. The languages are Bangla-English-Hindi.
This is a dataset for the sentiment analysis task. It contains 100k code mixed data. The languages are Bangla-English-Hindi.
Apps built using Inspired Cognition's Critique.
Benchmark Dataset for Code LLMs, Consisting of Academic Coding Prompts
The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia text, a romanization lexicon of words in the native script with attested romanizations, and some full sentence parallel data in both a native script of the
14 million, semi-supervised, mental disorder detection data.
This is a dataset for the offensive language detection task. It contains 1k natural code mixed data. The languages are Bangla-English-Hindi.
This is a dataset for the sentiment analysis task. It contains 1k natural code mixed data. The languages are Bangla-English-Hindi.
A Transliterated Bangla Offensive Language Identification Dataset