Namazu-devel-en(old)
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [namazu] Truncated result , last char missing
- From: knok@xxxxxxxxxxxxx
 
- Date: Thu, 05 Feb 2004 11:16:09 +0900
 
- X-ml-name: namazu-devel-en
 
- X-mail-count: 00100
 
- References: <1075736609.17010.66.camel@xav>	<200402030042.FMLAAB27348.namazu-devel-en@namazu.org>	<1075902441.10950.50.camel@xav>
 
At Wed, 04 Feb 2004 14:47:22 +0100,
Xavier Grangier wrote:
> NMZ.field.subject 
> 
> I have:
> 
> ---------------------------------------------------
> ....
> vrp multicarte maillots de bains--00--
> recherche rédacteurs web--00-
> Recherche Hotesses et Hôtes--00-
> Recherche Maître Chien, Agents de sécurite ,Gardien...--00--
> Dévelopement d'interface d'un jeu de Rugby--00-
> Commercial publicité terrain--00-
> ....
> ---------------------------------------------------
The possibility is the following lines in mknmz (put_field_index function):
            # Remove a garbage Kanji 1st char at the end.
            $output = codeconv::chomp_eucjp($output);
The function chomp_eucjp() is for Japanese processing. Japanese
characters are encoded in multiple bytes, so the function adjustments
incompolete miultibyte character.
> Does anyone have notice/fix this issue ?
Could you try to comment the above line out? If my suspicion is
correct, it will be fixed.
-- 
NOKUBI Takatsugu
E-mail: knok@xxxxxxxxxxxxx
	knok@xxxxxxxxxx / knok@xxxxxxxxxx