Welcome to the amazing world of Wombat-Japanese
Hello, everyone. |
software: NJ-Star
function: Japanese for DOS.
comment: This is a sophisiticated DOS word processor for
any IBM clone that allows for viewing and editiing of documents in all
major Japanese encoding systems, including SJIS, New JIS, NEC JIS, and
EUC. It allows for input in kana, romaji, and even pinyin. However. it is
a bit more combersome than input using a front-end editor like ATOK as
Kanji must be selected using a numeral rather than simply the space bar.
It's default mode is EUC, so most people will have to run the configuration
program to get it the way they want it. It includes a
Kanji dictionary that allows people learning Japanese to instantly look
up Kanji they don't know. Instalation is easy, as they four zip files,
which take up about 1.6MB, can be unzipped in a single directory and run
with no further ado. Unzipped, the program is about 3MB.
The shareware version is extremely useful,
although if someone wants to do high quality printing they should pay
$89 for the registered version that includes a manual and print fonts.
You will need the following text file and four zip files:
ftp://ftp.cc.monash.edu.au/pub/nihongo/njstar31.txt
ftp://ftp.cc.monash.edu.au/pub/nihongo/njj310a.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/njj310b.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/njj310c.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/njj310d.zip
software: NJ-Star for
function: Japanese for DOS.
comment: This word processor for Windows 95 has lots of capability and
frills, but unfortunately still uses a combersome kanji input system.
Kanji must be selected using a numeral rather than simply the space bar.
The zip file is about 3MB.
ftp://ftp.cc.monash.edu.au/pub/nihongo/njjwp420.exe
ftp://ftp.cc.monash.edu.au/pub/nihongo/njjwp420.txt
software: NJWin
function: CJK view for Windows
comment: This is a solid viewer with automatic detection of the various
Japanese codes, conversion between traditional and simplified Chinese
characters, and also has Korean capability, but no input system.
Also, I found occasional corrupt characters at the begining or end of a
line when using it with Microsoft Explorer, although I assume this is due
to the fact that the browser unknowingly split the two-bit Kanji and that
I would get better results using a CJK version of the browser.
ftp://ftp.cc.monash.edu.au/pub/nihongo/njwin160.exe
ftp://ftp.cc.monash.edu.au/pub/nihongo/njwin160.txt
software: MOKE (Mark's Own Kanji Editor
function: Japanese for DOS.
comment:
MOKE's is a text editor and not a word processor. It is less sophisticated
than NJStar, but may work on more primative systems, requiring only a 386,
MS-DOS 3.1, and a 2HD floppy drive at minimum. This version was released in
1990, although it has since transformed itself into commercial software.
Instalation is a bit tricky, as some files have to be placed in a seperate
directory. Wombat is providing two batch file that may be of help,
one for minimal (floppy-based) installation and one for full instalation.
It only seems to handle 7-bit encoding systems (i.e. JIS and EUC but not
SJIS), but includes a conversion program. A slightly better conversion
program is provided from Wombat below. The seven
zip files, which take up about 900KB and unzipped the program is about 2MB.
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11bgi.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11doc.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11exe.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11k16.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11skk.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11wnnd.zip
ftp://ftp.cc.monash.edu.au/pub/nihongo/mk11wsk.zip
The following batch files may help in installation:
http://www.pitt.edu/~ctnst3/cjk/mokehard.bat
http://www.pitt.edu/~ctnst3/cjk/mokeflop.bat
You will need the following two files:
ftp://cnd.org/pub/software/ms-win/c-sys/uwdbm4_e.exe
ftp://cnd.org/pub/software/ms-win/input/uwhk.zip
You will need to press here for UnionWay product ID in order to activate
the
software.
Directory : ftp://cnd.org/pub/software/ms-win/c-sys
Help file :
ftp://cnd.org/pub/software/ms-win/c-sys/uwdbm4_c.exe
software: Twin Bridge
function: CJK viewer for Windows
comment: TwinBridge is a package that adds CJK functionality to non-CJK
Windows. A demo version of TwinBridge for Japanese is at the following
URL:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/windows/twnbdemo.txt
software: MView
version: 1.00, 1995.7.12
function: Super-Software MView is a Microsoft Windows
integrated
software that allows viewing of Chinese, Japanese and Korean characters
under
Windows. The supported coding types include GB, HZ, BIG5, JIS, EUC, SJIS,
KSC,
UTF7 and UTF8. MView is distributed as a shareware. It is an inexpensive
software with some distinctive features that are not found in other
softwares.
MView has been tested on Netscape 1.1n, Netscape 1.2b2, Windows Notepad,
Windows Write and Norton Desktop Editor.
author: Albert Chong
comment: Mview is a smaller program than Uniway and very useful as
a
viewer, but it does not seem to run with as many applications.
You will need the zipped software file and the fonts for whichever
character
set you plan to use.
Software:
ftp://cnd.org/pub/software/ms-win/viewer/ss-mview.zip
Simplified Chinese font:
ftp://cnd.org/pub/software/fonts/gb/hbf/cclib.16
Traditional Chinese font:
ftp://cnd.org/pub/software/fonts/big5/hbf/chinese.16
Japanese font:
ftp://cnd.org/pub/software/fonts/misc/hbf/jis.16
Korean font:
ftp://cnd.org/pub/software/fonts/misc/hbf/ksc.16
Unified font:
ftp://cnd.org/pub/software/fonts/unicode/hbf/unihan16.hbf
Gunzipped set of all the fonts???:
ftp://cnd.org/pub/software/fonts/unicode/hbf/*16.bin.gz
software: MView
version: 1.00 Alt, 1995.7.16
function: IMPORTANT: This is an alternative version of MView
Version
1.00. This version uses only Unicode font files. Refer to mview_at.txt for
(7
bits), ISO Latin-1 (8 bits), Japanese, Chinese, Korean (16 bits) coded in
the
ISO2022 standard and its variants (e.g. EUC, Compound Text). For Chinese
there
is support for both GB and Big5. In addition, Arabic, IPA, Thai (based on
TIS620) and Vietnamese (based on VISCII and VSCII) are also supported.
author: Ken'ichi HANDA, Satoru TOMURA, Mikiko NISIKIMI
ftp://cnd.org/pub/software/mule/mule-2.3.tar.gz
ftp://cnd.org/pub/software/mule/diff-2.2.2-2.3.gz
ftp://cnd.org/pub/software/mule/diff-19.28-2.3.gz
software: Multi-Language Conversion System "Wnn"
version: 4.2
function: This packege includes the following software:
Japanese Conversion Server
Simplified Chinese Conversion Server
Traditional Chinese Conversion Server
Korean Conversion Server
Multi-Language Input Manager for X Window(R5/R6)
author: KUWARI Seiji
comment:I am not sure of the file name listed below, so you will
have to
look around the directory
ftp://cnd.org/pub/software/x-win/Wnn4.2.tar.gz
software: mule
version: 2.3 of 1995.7.24 (SUETSUMUHANA)
function: MULtilingual Enhancement to GNU Emacs 19.28 can handle
ASCII
(7 bits), ISO Latin-1 (8 bits), Japanese, Chinese, Korean (16 bits) coded
in
the ISO2022 standard and its variants (e.g. EUC, Compound Text). For
Chinese
there is support for both GB and Big5. In addition, Arabic, IPA, Thai
(based on
TIS620) and Vietnamese (based on VISCII and VSCII) are also supported.
authors: Ken'ichi HANDA, Satoru TOMURA, Mikiko NISIKIMI
ftp://cnd.org/pub/software/mule/mule-2.3.tar.gz
ftp://cnd.org/pub/software/mule/diff-2.2.2-2.3.gz
ftp://cnd.org/pub/software/mule/diff-19.28-2.3.gz
software: CJK
version: 3.0 beta 1, 1995.12.6
function: Enables Chinese/Japanese/Korean for use with LaTeX2e;
supports
GB, BIG5, CNS, JIS, KSC, and UTF8.
author: Werner Lemberg
ftp://cnd.org/pub/software/tex/CJK.3_0.1.tar.gz
Here are Ken Lunde's programs for converting, repairing, and printing JIS
codes compiled on a unix sun machine!
http://www.pitt.edu/~ctnst3/cjk/jis.exe
http://www.pitt.edu/~ctnst3/cjk/repair-jis.exe
http://www.pitt.edu/~ctnst3/cjk/print-jis.exe
Mr. Lunde was nice enough to make available to us his source codes, which
are
writen in "C". He has our gratitude!
http://www.pitt.edu/~ctnst3/cjk/jis.c
http://www.pitt.edu/~ctnst3/cjk/repair-jis.c
http://www.pitt.edu/~ctnst3/cjk/print-jis.c
The good people at Wombat have ported Mr. Lunde's algorythms to Turbo
Pascal and compiled versions that run on both MS-DOS for IBM and MS-DOS
for the NEC 9800 series.
http://www.pitt.edu/~ctnst3/cjk/wjc.pas
http://www.pitt.edu/~ctnst3/cjk/wjc.com (for NEC's)
http://www.pitt.edu/~ctnst3/cjk/wjc.exe (for IBM's)
Squeeze here for Macintosh software from CMU
This is an excellent and comprehensive online document that by Ken
Lunde
that provides information on CJK (that is,
Chinese, Japanese, and Korean) character set standards and encoding
systems. In short, it provides detailed information on how CJK text is
handled electronically. Mr. Lunde is happy to share this information with
others and would appreciate any comments/feedback on its
content. The current version (master copy) of this document is
maintained at:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/doc/cjk.inf
The following files contain the latest information about
changes (additions and corrections) made to UJIP for various
printings, both for those that have taken place (such as for the
second printing) and for those that are planned (the first digit is
the edition, and the second is the printing):
ftp://ftp.ora.com/pub/examples/nutshell/ujip/errata/ujip-errata-1-2.*
If you happen to be running X Windows, it is very easy to
display these CJK character sets (if a bitmapped font for the
character set exists, that is). Here is what I usually do:
ftp://cair-archive.kaist.ac.kr/pub/hangul/fonts/
Base64 encoding is most commonly used for encoding non-ASCII
text that appears in e-mail headers. Of all the portions of an e-mail
message, its header gets manipulated the most during transmission, and
Base64 encoding offers a safe way to further encode non-ASCII text so
that it is not altered by mail-routing software.
One typically does not need to worry about encoding text as
Base64 (MIME-compliant mailing software usually performs this task for
you). The problem is usually trying to decode Base64-encoded text. A
Base64 decoder is available in Perl at the following URL:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/perl/b64decode.pl
Most MIME-compliant e-mail software can decode Base64-encoded
text.
There are two types of conversions that can be performed. The
first type is converting between different encodings for the same
character set. This is usually without problems (but not always). The
second type is converting from one character set to another (it is not
usually relevant whether the underlying encoding has changed or not).
This usually involves the handling of characters that are in one
character set, but not the other. So, what to do?
I suggest JConv for handling Japanese code conversion (this
means converting between JIS, Shift-JIS, and EUC encodings). This is
in the category of different encodings for the same character set. The
following URLs provide executables or source code:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/mac/jconv-30.hqx
ftp://ftp.ora.com/pub/examples/nutshell/ujip/unix/
For software that handles Chinese code conversion (this
includes conversion to and from Japanese), I suggest browsing at the
following URLs:
ftp://ftp.ifcss.org/pub/software/dos/convert/
http://www.fujitsu.co.jp/
http://www.netscape.com/
Mule (multilingual enhancement to GNU Emacs), written by
Kenichi Handa (handa@etl.go.jp), is the first (and only?) CJK-capable
editor for UNIX systems, and is freely available under the terms of
the GNU General Public License. Mule was developed from Nemacs
(Nihongo Emacs).
Mule is available at the following URL:
ftp://etlport.etl.go.jp/pub/mule/
CNPRINT, developed by Yidao Cai (cai@neurophys.wisc.edu), is a
utility to print CJK text (or convert it to a PostScript file), and is
available for DOS, VMS, and UNIX systems. A wide range of encoding
methods are supported by CNPRINT.
CNPRINT is available at the following URLs:
ftp://ftp.ifcss.org/pub/software/{dos,unix,vms}/print/
Nisus Writer, written by Nisus Software and available for
Macintosh, is fully CJK-capable as long as you have the appropriate
scripts installed (such as CLK for Chinese or JLK for Japanese).
A"Language Key" is also required for Chinese and Korean (and some
one-byte scripts such as Arabic and Hebrew).
The CJK encodings that are supported by Nisus Writer are the
same as made available by the underlying Macintosh operating system.
No import/export of other encodings is supported. You must run
separate conversion utilities for both import and export.
A demo version of Nisus Writer is available at the following
URL:
ftp://ftp.nisus-soft.com/pub/nisus/demos/NisusWriterDemo.sea.hqx
info@nisus-soft.com
http://www.nus/
The following Usenet Newsgroups typically have postings with
information relevant to issues discussed in CJK.INF (in alphabetical
order):
alt.chinese.computing
alt.chinese.text (HZ encoding used for Chinese text)
alt.chinese.text.big5 (Big Five encoding used for Chinese text)
alt.japanese.text (JIS encoding used for Japanese text)
comp.software.international
comp.std.internat
fj.editor.mule (JIS encoding used for Japanese text)
fj.kanji (JIS encoding used for Japanese text)
sci.lang.japan (JIS encoding used for Japanese text)
If your local news host does not provide a feed of the fj.*
newsgroups (shame on them!), or if you do not have access to Usenet
News, you can alternatively fetch them from the following URL:
ftp://kuso.shef.ac.uk/pub/News/
Below are the URLs for useful FTP sites. The directory
specified is the recommended place from which to start poking around
for useful files.
ftp://cair-archive.kaist.ac.kr/pub/hangul/
A.2.2: USEFUL TELNET SITES
For those who have a NIFTY-Serve account, there is now a very
convenient way to access NIFTY-Serve using telnet. The URL is as
follows:
telnet://r2.niftyserve.or.jp/
telnet://compuserve.com/
You can also do a search on fj.* newsgroup articles at the
following URL:
telnet://asahi-net.or.jp/
A.2.4: USEFUL WWW SITES
Because the World-Wide Web is a constantly changing place (and
more importantly, because I don't want to re-issue a new version of
this document every month!), I will maintain links to useful documents
at my WWW Home Page. Its URL is as follows:
http://jasper.ora.com/lunde/
Many RFCs (Request for Comments) are relevant to this
document. They are:
o RFC 1341: "MIME (Multipurpose Internet Mail Extensions): Mechanisms
for Specifying and Describing the Format of Internet Message
Bodies," by Nathaniel Borenstein and Ned Freed, June 1992.
o RFC 1468: "Japanese Character Encoding for Internet Messages," by
Jun Murai et al., June 1993.
o RFC 1554: "ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP," by
Masataka Ohta and Kenichi Handa, December 1993.
o RFC 1557: "Korean Character Encoding for Internet Messages," by
Uhhyung Choi et al., December 1993.
These RFCs can be obtained from FTP archives that contain all RFC
documents (such as URLs ftp://nic.ddn.mil/rfc/ or ftp://ftp.uu.net/
inet/rfc/), but these specific ones are mirrored at the following URL:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/Ch9/
A.3.5: FAQs
There are several FAQ (Frequently Asked Questions) files that
provide useful information. The following is a listing of a some along
with their URLs:
o "sci.lang.japan" FAQ by Rafael Santos (santos@mickey.ai.kyutech.ac.jp)
at:
http://www.mickey.ai.kyutech.ac.jp/japanese/
o "Programming for Internationalization" FAQ by Michael Gschwind
(mike@vlsivie.tuwien.ac.at) at:
ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/ISO-programming
o "Japanese Internet Service Providers" FAQ by Mike Collinson
(mike@uxp.bs2.mt.nec.co.jp) at
ftp://ftp.ora.com/pub/examples/nutshell/ujip/faq/japan-internet.FAQ
o "How to Use Japanese on the Internet with a PC: From Login to WWW"by
Hideki Hirayama (sgw01623@niftyserve.or.jp) at:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/faq/jpn-inet.FAQ
Press here to learn All About
Sports in Pittsburgh.
Press here to learn Miscelanious
Computer stuff, like Pascal and shell accounts.
Press here for an English text
book for Japanese speakers (a great place to test out Japanese
language
software!)
Press here to go back to Titus
North's Home Page, featuring All About CHON WOLSON, Links to Japanese
and Chinese language shareware, information about the International
Chindogu Society, and much much more.
Squeeze here for the University of
Pittsburgh's East Asian Library, which can be interesting at
times.
A corresponding README file that provides details of the demo (and
contact information) is at the following URL:
ftp://omron3.sp.cs.cmu.edu/pub/mac/japanese
This file may also be obtained by contacting Mr. Lunde directly using one
of
the e-mail addresses listed in the CONTACT INFORMATION section.
ftp://ftp.ora.com/pub/examples/nutshell/ujip/errata/ujip-errata-1-3.*
The asterisk is used for the file extension that indicates the
Japanese code used (jis, euc, or sjs) -- pick the one that is best
suited for your display or printing environment. I *highly* recommend
that all readers of UJIP obtain these errata files. Those without FTP
access can request copies directly from me.
o Obtain a BDF (Bitmap Distribution Format) font for the target
character set. Try the following URLs:
ftp://etlport.etl.go.jp/pub/mule/fonts/
ftp://ftp.ifcss.org/pub/software/fonts/{big5,cns,gb,misc,unicode}/bdf/
ftp://ftp.ora.com/pub/examples/nutshell/ujip/unix/
BDF files usually have the string "bdf" somewhere in their file
name. If the file is compressed (noticing that it ends in .gz or .Z
is a good indication), decompress it. BDF files are text files.
Note that this program takes "raw" Base64 data as input. Any non-
Base64 stuff must be stripped. I usually run this from within Mule
(C-u M-| b64decode.pl) after defining a region around the Base64-
encoded material.
4.5: CODE CONVERSION TIPS
ftp://ftp.ora.com/pub/examples/nutshell/ujip/mac/jconv-dd-181.hqx
ftp://ftp.ora.com/pub/examples/nutshell/ujip/dos/jconv.exe
ftp://ftp.ora.com/pub/examples/nutshell/ujip/src/jconv.c
There are other programs available that do the same basic thing as
JConv, such as kc and nkf. They are available at the following URL:
For source codes for useful programs by Ken Lunde that repair damaged JIS
files and other neat stuff, try the following:
ftp://ftp.ifcss.org/pub/software/mac/convert/
ftp://ftp.ifcss.org/pub/software/ms-win/convert/
ftp://ftp.ifcss.org/pub/software/unix/convert/
ftp://ftp.ifcss.org/pub/software/vms/convert/
ftp://ftp.ora.com/pub/examples/nutshell/ujip/map/
The latter URL mirrors some material made available from Koichi
Yasuoka (yasuoka@kudpc.kyoto-u.ac.jp).
When WWW documents first came online, there was no method for
handling CJK character sets. This has, fortunately, changed. As of
this writing, two commercial WWW browsers support Japanese. They are
Infomosaic by Fujitsu Limited, and Netscape Navigator by Netscape
Communications Corporation (Version 1.1 added Japanese support). Both
are graphics-based browsers. The former can be ordered at the
following URL:
The latter can be found at the following URLs:
ftp://ftp.netscape.com/
http://condor.stcloud.msus.edu:20020/netscape.html
I *highly* suggest reading it.
7.1: MULE
7.2: CNPRINT
ftp://neurophys.wisc.edu/[public.cn]/
7.3: NISUS WRITER
Give it a try! Updaters are also available at the same FTP site.
Nisus Software can be contacted using the following e-mail
address or through their WWW page:
The subdirectories correspond to the newsgroup name, but with the"dots"
being replaced by "slashes." For example, the "fj.binaries.mac"
newsgroup is archived in the "fj/binaries/mac" subdirectory. Many
thanks to Earl Kinmonth (jp1ek@sunc.shef.uc.uk) for this service.
A.2.1: USEFUL FTP SITES
ftp://etlport.etl.go.jp/pub/mule
ftp://ftp.adobe.com/pub/adobe/
ftp://ftp.cc.monash.edu.au/pub/nihongo/
ftp://ftp.ifcss.org/pub/software/
ftp://ftp.ifcss.org/pub/software/
ftp://ftp.ora.com/pub/examples/nutshell/ujip/
ftp://ftp.uwtc.washington.edu/pub/Japanese/
ftp://kuso.shef.ac.uk/pub/Japanese/
ftp://unicode.org/pub/
This list is expected to grow.
Another information service with a similar access mechanism is
CompuServe, whose URL is as follows:
You will need to press the return key to get the "Host Name:" prompt,
at which time you type "cis" (just follow the menus from this point
on).
You login as "fj-db" once you are connected.
A.3.4: RFCs
An RFC to cover Chinese is currently in the works, and will be
made available at the above URL as soon as it is ready.
Update announcements are usually posted to the sci.lang.japan
newsgroup.
Also posted to the comp.software.international newsgroup.
o "Internationalization Reference List" by Eugene Dorr
(gdorr@pgh.legent.com) at:
ftp://ftp.ora.com/pub/examples/nutshell/ujip/doc/i18n-books.txt
Note really a FAQ, but quite useful.