Wiktionary dump file parser and multilingual data extractor
-
Updated
Sep 15, 2025 - Python
Wiktionary dump file parser and multilingual data extractor
Discover the most comprehensive dictionaries built on Wiktionary. Universal, multilingual & monolingual—bimonthly updates, 180+ languages supported.
The last online dictionary CLI framework you need.
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.
Extract data from German Wiktionary XML files.
Anki add-on to look up vocabulary using Wiktionary
Web front end for WikDict dictionaries
Anki add-on to view and extract info from ZIM files
Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)
Language files for WordDumb
A program for creating a searchable local language dictionary based (mainly) on dumped wiktionary data. Allows user to collect definitions which can be exported as a machine readable flashcard file. Currently supports Latin, Ancient Greek and Old English.
Scrapes Wiktionary to find cognates
lookup words and pronunciations in Wiktionary
German IPA dictionary as extracted from wiktionary
Add a description, image, and links to the wiktionary topic page so that developers can more easily learn about it.
To associate your repository with the wiktionary topic, visit your repo's landing page and select "manage topics."