The JMdictDB Project

News

New 2021-12-05:

The JMdictDB web backend now uses WSGI and the Flask web framework rather than CGI. The installation docs are still being revised but some instructions for upgrading an existing site are available here: http://www.edrdg.org/~smg/doc/2021-11-update.html November 2021 WSGI Upgrade.

The CGI backend remains available but is deprecated and will be removed in the future.

New 2020-06-23:

A simpler installation procedure has been implemented that allows the JMdictDB software to be installed and configured for per-user use or system-wide. Once installed, the installed version is completely independent of the Git repository it was installed from allowing the latter to be removed, used for development, etc. without affecting the former. The installation documentation was rewritten and simplified.

In support of this, the directory structure of the source code has been completely reorganized.

A number of internal refactorings have been made in order to make the code more maintainable.

The changes are too numerous to list here in full, please see:

Changes: doc/2020-06-changes.html
Update procedure: doc/2020-06-update.html

New 2019-03-10: Two new features:

The advanced search page now allows searching for entries by the contents of their sense note or history comments and references. You can search for a text string or regular expression match.
When entries are displayed, URLs in the history comments and refs fields are now shown as clickable hyperlinks. Please note though that the URL detection code is imperfect and because of browser quirks, many URLs were split across multiple lines resulting in only the initial part of the URL being converted to a link. For these, you will still need to copy-paste the full URL into your browser to view the page.

New 2018-09-15: The JMdictDB source code is now using Git for version management instead of Mercurial and the primary repository is now at GitLab:

https://gitlab.com/yamagoya/jmdictdb

The former JMdictDB Issue tracker based on Roundup has also been discontinued and replaced by the Issues feature of GitLab:

https://gitlab.com/yamagoya/jmdictdb/issues

Only issues tagged as "Bug" were migrated to GitLab; a decision on where and how to provide access to the old development issues is still pending.

New 2018-06-10 (revised 2018-09-15): conj.py is a standalone Python program that uses the conjugation tables developed for the JMdictDB project to demonstrate how simple a table-based Japanese word conjugator can be when using this approach. It has been moved out of JMdictDB to a separate, independent (git) project:

https://gitlab.com/yamagoya/jconj

(Revised to remove GitHub URL; GitLab is currently the only supported location.)

Older news items...

Project Description

JMdictDB is an open source project that provides a Postgresql database, Python API and web CGI front-end for storing and maintaining Japanese / multilingual dictionary data, primarily in support of Jim Breen's Japanese-English dictionary projects including: JMdict, JMnedict, Kanjidic2 WWWJDIC and others. Jim runs these projects under the auspices of the Electronic Dictionary Research and Development Group EDRDG).

The goals of this project (in priority order) are:

To create a database to serve as a master repository for the information in the JMdict, EDICT, JMnedict, Examples, Kanjidic and other related files distributed by Jim Breen and the EDRDG.
To provide a web-based system for the submission, review, and approval of corrections and new entries to these data.
To provide freely available software to others who want to use or build upon, "JMdict in a database".
To provide an open-source replacement for the principal author's Microsoft Access based JMdict database. :-)

Discussion of this project takes place on the Google Groups Edict-JMdict mailing list (edict-jmdict+subscribe@googlegroups.com / https://groups.google.com/g/edict-jmdict/about / https://www.edrdg.org/jmdict_edict_list). Jim Breen maintains a web page describing the JMdict project's use of JMdictDB at http://www.edrdg.org/wiki/index.php/JMdictDB_Project.

The project code is still undergoing active development and no promises are made regarding stability or backward compatibility. However, it is currently in use as the primary repository for the JMdict project dictionary data and the web interface is in use for submitting new entries and corrections to existing entries in WWWJDIC.

All the code developed for this project is GPL'd and maintained in a publicly accessible Git repository (links below). Additional help is welcome; please post to the edict-jmdict mailing list, or email the current principal developer at the address at the bottom of this page.

The code currently consists of scripts to create and load JMdict (and related data such as the JMnedict "Japanese names" file, or the Tatoeba "examples" file) into a Postgresql database, some maintenance and other command line tools, and a set of CGI scripts to allow access and updating of the database using a web browser. The code is written in Python-3 (Python-2 is no longer supported) and is tested under Ubuntu Linux. More information on prerequisites is in the doc/install.html file. file.

Access

Access to the instance of JMdictDB running at Jim Breen's edrdg.org site is through either the Edit links at on words looked up at wwwjdic or directly at: Simple Search, Advanced Search, New Entry, Recent Updates.

Code and Documentation

jmdictdb -- JMdictDB project hosted at GitLab.
Issue tracker -- Issue tracker (at GitLab) for the JMdictDB project.
jmdictdb.tar.gz -- Download source code, latest development version (gzipped tar file).
install.html -- The INSTALL file, includes prerequisites and installation instructions.
schema.html -- Comprehensive description of the database schema (2008-11-12).
schema.png, schema.pdf -- Diagram of the database schema (200KB, 2008-11-12).

JMdictDB at edrdg.org

The following links provide details about the implementation and use of JMdictDB at Jim Breen's EDRDG (Electronic Dictionary Research and Development Group) website (http://edrdg.org), for receiving new entry and correction suggestions for wwwjdict:

http://www.edrdg.org/wiki/index.php/Main_Page
http://www.edrdg.org/wiki/index.php/JMdictDB_Project
http://www.edrdg.org/wiki/index.php/Editorial_policy
http://nihongo.monash.edu/edictredev/index.html (some historical info from early in the development process)