Main Page: Difference between revisions

From EDRDG Wiki
Jump to navigation Jump to search
No edit summary
 
No edit summary
Line 1: Line 1:
<big>'''MediaWiki has been successfully installed.'''</big>
==Electronic Dictionary Research and Development Group==


Consult the [http://meta.wikimedia.org/wiki/Help:Contents User's Guide] for information on using the wiki software.
Welcome to the Wiki of the [[About EDRDG |  Electronic Dictionary Research and Development Group]]. The Wiki is being developed as a repository of information and documentation about the Group's projects.  


== Getting started ==
==The JMdict/EDICT Project==
* [http://www.mediawiki.org/wiki/Manual:Configuration_settings Configuration settings list]
 
* [http://www.mediawiki.org/wiki/Manual:FAQ MediaWiki FAQ]
This project is to build a freely-usable general Japanese dictionary file. It began in 1991 with the EDICT Japanese-English file in a simple format, and in 1999 expanded into the XML-format JMdict file
* [https://lists.wikimedia.org/mailman/listinfo/mediawiki-announce MediaWiki release mailing list]
Some useful links are:
 
*the [http://www.csse.monash.edu.au/~jwb/j_jmdict.html overview documentation of the JMdict file]
*the [http://www.csse.monash.edu.au/~jwb/edict.html overview documentation of the EDICT file]
*the main [http://www.csse.monash.edu.au/~jwb/edict_doc.html documentation of the JMdict/EDICT dictionary files]
*the [http://www.csse.monash.edu.au/~jwb/edrdg/licence.html licence statement for use of the projects' files]. This licence also applies to the contents of this Wiki.
*lists of [[JMdictEDICT_software|packages and servers]] using the JMdict/EDICT files
 
===Editorial Policy/Guidelines===
 
A page containing the [[editorial policy]] and guidelines for the JMdict/EDICT files is being developed.
 
===Entries Under Development===
 
There is an [[Entries Under Development]] page, where people can place incomplete words and phrases for later filling out to become full entries.
 
==The Tanaka Corpus==
 
This project is to maintain and extend the [http://www.edrdg.org/projects/tanaka/tanakacorpus.html Tanaka Corpus] which is a large collection of parallel Japanese/English sentence pairs.
 
==The KANJIDIC Project==
 
The KANJIDIC project has compiled files of data on kanji used in Japanese text processing. The files
cover the kanji in three Japanese standards:
* JIS X 0208-1998, which includes 6,355 kanji.
* JIS X 0212-1990, which includes extra 5,801 kanji
* JIS X 0213-2004, which extends JIS X 0208, overlaps with some of JIS X 0212, and adds 884 extra kanji.
 
Three data files are distributed by this project:
* the KANJIDIC2 file, which is in XML format, and contains all the kanji. For this file the following information is available:
** a project [http://www.csse.monash.edu.au/~jwb/kanjidic2/ overview page]
** a file [http://www.csse.monash.edu.au/~jwb/kanjidic2/kanjidic2_ov.html overview]
** the [http://www.csse.monash.edu.au/~jwb/kanjidic2/kanjidic2_dtdh.html DTD]
** a [http://www.csse.monash.edu.au/~jwb/kanjidic2/kd2examph.html sample entry]
* the KANJIDIC file, which covers the 6,355 kanji in JIS X 0208. For this there is the
** [http://www.csse.monash.edu.au/~jwb/kanjidic2/ overview page]
** [http://www.csse.monash.edu.au/~jwb/kanjidic_doc.html original documentation]
* the KANJD212 file, which covers the 5,801 kanji in JIS X 0212. for this there is the
** [http://www.csse.monash.edu.au/~jwb/kanjd212_doc.html original documentation]
* [http://www.pokerenfrancais.eu/modes-de-paiement poker virtuel]
 
==The COMPDIC Project==
 
The COMPDIC project compiled a glossary of terms used in the computing and telecommunications industries. It is in the "EDICT" format. See the [http://ftp.monash.edu.au/pub/nihongo/compdic_doc.html brief documentation].
 
==The ENAMDICT/JMnedict Project==
 
The ENAMDICT file contains about 720,000 proper names in Japanese. It is in EDICT format, with some special tags to indicate the type of proper name. It is also available in XML format as the Japanese-Multilingual named entity dictionary (JMnedict).There is a basic [http://www.csse.monash.edu.au/~jwb/enamdict_doc.html documentation page].
 
==The KRADFILE/RADKFILE Project==
 
This project provides a decomposition of kanji into a number of visual elements or radicals to support software which provides a lookup service using kanji components.
 
There is an [http://www.csse.monash.edu.au/~jwb/kradinf.html information page] about the files.
 
 
==The WWWJDIC Dictionary Server==
 
* [http://www.csse.monash.edu.au/~jwb/cgi-bin/wwwjdic.cgi?1C home page] of the server (at Monash).
 
* [http://www.csse.monash.edu.au/~jwb/wwwjdicinf.html User's Guide]
 
* [[WWWJDIC in Japanese]] project
 
* [[Common words]] - the 850 common words from Ogden's list. To be used to enhance English-Japanese lookups.
 
==Mailing List==
 
There is a [http://tech.groups.yahoo.com/group/edict-jmdict/ mailing list] for people engaged in the EDRDG projects.
 
==How Can I Help?==
 
From time to time people ask how they can best contribute to the projects. There are many ways of assisting, the main ones being:
 
* adding to and enhancing the main (EDICT/JMdict) dictionary file. This is best done by using the [http://www.csse.monash.edu.au/~jwb/cgi-bin/wwwjdic.cgi?17 New Entry/Amendment] page of WWWJDIC.
 
* adding extra Japanese-English sentence pairs to the collection based on the Tanaka Corpus. There is a [http://www.csse.monash.edu.au/~jwb/cgi-bin/wwwjdic.cgi?14 New Examples] function in WWWJDIC for this.
 
* assisting with the translation of the WWWJDIC interface into other languages. At present the priority is to make it fully available in Japanese. See the [[WWWJDIC in Japanese]] page.
 
* work through the lists of words Paul Blay has place on the [[Talk:Tanaka_Corpus]] page, which could become new dictionary entries.
 
* join and participate in the [http://tech.groups.yahoo.com/group/edict-jmdict/ mailing list] for people engaged in the EDRDG projects.

Revision as of 20:44, 18 March 2010

Electronic Dictionary Research and Development Group

Welcome to the Wiki of the Electronic Dictionary Research and Development Group. The Wiki is being developed as a repository of information and documentation about the Group's projects.

The JMdict/EDICT Project

This project is to build a freely-usable general Japanese dictionary file. It began in 1991 with the EDICT Japanese-English file in a simple format, and in 1999 expanded into the XML-format JMdict file Some useful links are:

Editorial Policy/Guidelines

A page containing the editorial policy and guidelines for the JMdict/EDICT files is being developed.

Entries Under Development

There is an Entries Under Development page, where people can place incomplete words and phrases for later filling out to become full entries.

The Tanaka Corpus

This project is to maintain and extend the Tanaka Corpus which is a large collection of parallel Japanese/English sentence pairs.

The KANJIDIC Project

The KANJIDIC project has compiled files of data on kanji used in Japanese text processing. The files cover the kanji in three Japanese standards:

  • JIS X 0208-1998, which includes 6,355 kanji.
  • JIS X 0212-1990, which includes extra 5,801 kanji
  • JIS X 0213-2004, which extends JIS X 0208, overlaps with some of JIS X 0212, and adds 884 extra kanji.

Three data files are distributed by this project:

The COMPDIC Project

The COMPDIC project compiled a glossary of terms used in the computing and telecommunications industries. It is in the "EDICT" format. See the brief documentation.

The ENAMDICT/JMnedict Project

The ENAMDICT file contains about 720,000 proper names in Japanese. It is in EDICT format, with some special tags to indicate the type of proper name. It is also available in XML format as the Japanese-Multilingual named entity dictionary (JMnedict).There is a basic documentation page.

The KRADFILE/RADKFILE Project

This project provides a decomposition of kanji into a number of visual elements or radicals to support software which provides a lookup service using kanji components.

There is an information page about the files.


The WWWJDIC Dictionary Server

  • Common words - the 850 common words from Ogden's list. To be used to enhance English-Japanese lookups.

Mailing List

There is a mailing list for people engaged in the EDRDG projects.

How Can I Help?

From time to time people ask how they can best contribute to the projects. There are many ways of assisting, the main ones being:

  • adding to and enhancing the main (EDICT/JMdict) dictionary file. This is best done by using the New Entry/Amendment page of WWWJDIC.
  • adding extra Japanese-English sentence pairs to the collection based on the Tanaka Corpus. There is a New Examples function in WWWJDIC for this.
  • assisting with the translation of the WWWJDIC interface into other languages. At present the priority is to make it fully available in Japanese. See the WWWJDIC in Japanese page.
  • work through the lists of words Paul Blay has place on the Talk:Tanaka_Corpus page, which could become new dictionary entries.
  • join and participate in the mailing list for people engaged in the EDRDG projects.