Namazu-devel-en(old)
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Indexing pdf files with japanese characters...
"Navneet Saraogi" wrote:
> Now I'm able to create indexes on pdf files & search for english
> characters... The problem is that I'm unable to search for japanese
> characters in a pdf file... I ran the pdftotext command just to check
> the validity of the converter... Itz not converting pdf with japanese
> characters properly...
Use "-enc EUC-JP" command line option for pdftotext.
To index japanese text file, you need to run mknmz with
"--indexing-lang=ja" option.
See tutorial.
http://www.namazu.org/doc/tutorial.html.en#japanese
Yukio USUDA