Namazu-devel-en(old)


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Indexing pdf files with japanese characters...



"Navneet Saraogi"  wrote:
> 	Now I'm able to create indexes on pdf files & search for english
> characters... The problem is that I'm unable to search for japanese
> characters in a pdf file... I ran the pdftotext command just to check
> the validity of the converter... Itz not converting pdf with japanese
> characters properly...

Use "-enc EUC-JP" command line option for pdftotext.

To index japanese text file, you need to run mknmz with
 "--indexing-lang=ja" option.

See tutorial.
http://www.namazu.org/doc/tutorial.html.en#japanese

Yukio USUDA