Wikipedia text corpus for self-supervised NLP model training
-
Updated
Jul 17, 2022 - Python
Wikipedia text corpus for self-supervised NLP model training
Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.
A simple Ruby script that contains a GUI desktop application providing typical NLP tasks ready to apply on English or German text files. Available for macOS, Windows and Linux.
DERBI (DEutscher RegelBasierter Inflektor) is a simple rule-based automatic inflection model for German based on spaCy. Applicable regardless of POS!
Ph.D. Thesis
future time reference classification in english, dutch, and german
German Categorized Wordlist Project
Web scraper written with Scrapy to extract user reviews in German of organic and fair trade coffee brands
Text Normalization on Learner Texts (South Tyrolean German as a L2)
Skills classifier assignment challenge
Sentiment analysis benchmark: Classical ML (TF-IDF/SVM) vs Fine-tuned BERT vs GPT-4o LLM on German & English customer feedback. Streamlit dashboard, FastAPI, 39 tests, CI/CD.
202-Hours-German-Gaming-Real-world-Casual-Conversation-and-Monologue-speech-dataset
Add a description, image, and links to the german-nlp topic page so that developers can more easily learn about it.
To associate your repository with the german-nlp topic, visit your repo's landing page and select "manage topics."