Namazu-devel-en(old)
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Doubts
Navneet> 1. For Japanese documents Namazu stores indexes for the searched text in kana hence converts all Kanji's
Navneet> to kana. Therefore the search cannot differentiate between those two different Kanji charcaters. Is this
Navneet> true?
NO, the first assumption is NO. NMZ.* files stores two bytes charactors
as it is.
Navneet> 2. Namazu uses software called nkf for Japanese processing. NKF 1.71 supports the following encoding -
Navneet> 7-bit JIS, MS-kanji (Shift_JIS) or EUC.
Yes:
Navneet> It does not support UTF-8. I heard NKF 2.01 onwards supports UTF-8 but the recommended version of NKF for
Navneet> Namazu 2.0 is NKF 1.71???
Namazu does not support UTF-8 internally.
---
Makoto Fujiwara,
Chiba, Japan, Narita Airport and Disneyland prefecture.