Namazu-win32-users-ja(旧)
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Excelファイルの検索
- From: Hiroyuki Yamada <hiroyuki@xxxxxxxxxxxxxxxxxxxxxxxx>
- Date: Thu, 22 Aug 2002 11:29:36 +0900
- X-ml-name: namazu-win32-users-ja
- X-mail-count: 01443
山田@京都です。
はじめまして。Namazu for Win32をインストールして、Excelファイルの全文
検索をテストしてみたのですが、
mknmzコマンドでインデックスは、出来ているようなのですが、namazuコマンド
では、「検索式にマッチ
する文章は、ありませんでした。」と表示されます。
Ms-wordの文章は、ちゃんと検索できているようです。
Wordファイルが検索できでも、Excelファイルは、検索できないということがあ
るのでしょうか?
ご教授の程、よろしくお願いします。
(動作環境)
OS:Windows98SE
Excel 2002(10.2614.2625)
Word 2002(10.2627.2625)
「perl nmzchk.pl > nmzchk.txt」の実行結果を載せておきます。
Content-type: text/plain
=== printout opendir(CURDIR,".") ===
name:nmzchk.pl
dev=2 ino=0 mode=33206 nlink=1
uid=0 gid=0 rdev=2 size=4879
atime=1029942000 mtime=973710388 ctime=1029814243
blksize= blocks=
name:NMZSETUP.BAT
dev=2 ino=0 mode=33279 nlink=1
uid=0 gid=0 rdev=2 size=14575
atime=1029942000 mtime=1006870680 ctime=1029814243
blksize= blocks=
name:nmzchk.txt
dev=2 ino=0 mode=33206 nlink=1
uid=0 gid=0 rdev=2 size=0
atime=1029942000 mtime=1029982268 ctime=1029910150
blksize= blocks=
=== printout $ENV ===
HOME = >>>C:\namazu<<<
ITAIJIDICTPATH = >>>c:\kakasi\share\kakasi\itaijidict<<<
KANWADICTPATH = >>>c:\kakasi\share\kakasi\kanwadict<<<
LANG = >>>ja_JP.SJIS<<<
MKNMZRC = >>>C:\namazu\etc\namazu\mknmzrc<<<
--- printout C:\namazu\etc\namazu\mknmzrc ---
package conf; # Don't remove this line!
$HTML_SUFFIX = "html?|[ps]html|html\\.[a-z]{2}";
$ALLOW_FILE = ".*\\.(?:$HTML_SUFFIX)|.*\\.txt" . # HTML, plain text
$DENY_FILE =
".*\\.(gif|png|jpg|jpeg)|.*\\.tar\\.gz|core|.*\\.bak|.*~|\\..*|\x23.*";
$DIRECTORY_INDEX = "";
$REMAIN_HEADER = "From|Date|Message-ID";
$SEARCH_FIELD =
"message-id|subject|from|date|uri|newsgroups|to|summary|size";
$META_TAGS = "keywords|description";
$FIELD_ALIASES = ('title' => 'subject', 'author' => 'from');
$NON_SEPARATION_ELEMENTS =
'A|TT|CODE|SAMP|KBD|VAR|B|STRONG|I|EM|CITE|FONT|U|'.
$ON_MEMORY_MAX = 5000000;
$FILE_SIZE_MAX = 2000000;
$TEXT_SIZE_MAX = 600000;
$WORD_LENG_MAX = 128;
$LIBDIR = 'C:/namazu/share/namazu/pl';
$FILTERDIR = 'C:/namazu/share/namazu/filter';
$TEMPLATEDIR = 'C:/namazu/share/namazu/template';
1;
-------------------------
NAMAZULOCALEDIR = >>>C:\namazu\share\locale<<<
NAMAZURC = >>>C:\namazu\etc\namazu\namazurc<<<
--- printout C:\namazu\etc\namazu\namazurc ---
Index C:\namazu\var\namazu\index
Lang ja_JP.SJIS
-------------------------
PATH =
>>>C:\NAMAZU\BIN;C:\PERL\BIN\;C:\WINDOWS;C:\WINDOWS;C:\WINDOWS\COMMAND;C:\JDK1.3.1_04\BIN;C:\KAKASI\BIN;<<<
=== where ===
C:\NAMAZU\BIN/namazu.exe
C:\PERL\BIN\/perl.exe
C:\KAKASI\BIN/kakasi.exe
C:\NAMAZU\BIN/mknmz.bat
C:\NAMAZU\BIN/gcnmz.bat
C:\PERL\BIN\/ppm.bat
C:\PERL\BIN\/pl2bat.bat
=== versions ===
--- namazu -v ---
namazu of Namazu 2.0.10
Copyright (C) 1997-1999 Satoru Takabayashi All rights reserved.
Copyright (C) 2000,2001 Namazu Project All rights reserved.
This is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2, or (at your option)
any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty
of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
------------
--- namazu -C ---
読み込んだ設定ファイル: C:\namazu\etc\namazu\namazurc
--
インデックス (Index): C:\namazu\var\namazu\index
ログの記録 (Logging): on
使用する言語 (Lang): ja_JP.SJIS
スコア計算 (Scoring): tfidf
テンプレート (Template):
ヒット件数の上限 (MaxHit): 10000
マッチする語の上限 (MaxMatch): 1000
強調タグ (EmphasisTags): <strong class="keyword"> </strong>
------------
--- perl -v ---
This is perl, v5.6.1 built for MSWin32-x86-multi-thread
(with 1 registered patch, see perl -V for more detail)
Copyright 1987-2001, Larry Wall
Binary build 630 provided by ActiveState Tool Corp.
http://www.ActiveState.com
Built 09:32:05 Nov 11 2001
Perl may be copied only under the terms of either the Artistic License
or the
GNU General Public License, which may be found in the Perl 5 source kit.
Complete documentation for Perl, including FAQ lists, should be found on
this system using `man perl' or `perldoc perl'. If you have access to
the
Internet, point your browser at http://www.perl.com/, the Perl Home
Page.
------------
--- nkf -v ---
コマンドまたはファイル名が違います.
------------
--- kakasi -v ---
KAKASI - Kanji Kana Simple Inverter Version 2.3.4
Copyright (C) 1992-1999 Hironobu Takahashi. All rights reserved.
Usage: kakasi -a[jE] -j[aE] -g[ajE] -k[ajKH] -E[aj] -K[ajkH] -H[ajkK]
-J[ajkKH]
-i{oldjis,newjis,dec,euc,sjis}
-o{oldjis,newjis,dec,euc,sjis}
-r{hepburn,kunrei} -p -s -f -c"chars" [jisyo1, jisyo2,,,]
Character Sets:
a: ascii j: jisroman g: graphic k: kana (j,k defined in
jisx0201)
E: kigou K: katakana H: hiragana J: kanji(E,K,H,J defined in
jisx0208)
Options:
-i: input coding system -o: output coding system
-r: romaji conversion system
-p: list all readings (with -J option)
-s: insert separate characters (with -J option)
-f: furigana mode (with -J option)
-c: skip chars within jukugo (with -J option: default TAB CR LF
BLANK)
-C: romaji Capitalize (with -Ja or -Jj option)
-U: romaji Upcase (with -Ja or -Jj option)
-u: call fflush() after 1 character output
-w: wakatigaki mode
Report bugs to <bug-kakasi@xxxxxxxxxx>.
------------
--- mknmz -v ---
mknmz of Namazu 2.0.10
Copyright (C) 1997-1999 Satoru Takabayashi All rights reserved.
Copyright (C) 2000,2001 Namazu Project All rights reserved.
This is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2, or (at your option)
any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty
of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
------------
--- mknmz -C ---
読み込んだ設定ファイル: C:/namazu/etc/namazu/mknmzrc
システム: MSWin32
Namazu: 2.0.10
Perl: 5.006001
NKF: module_nkf
KAKASI: module_kakasi -ieuc -oeuc -w
茶筌: chasen -j -F '%m '
わかち書き: module_kakasi -ieuc -oeuc -w
メッセージの言語: ja_JP.SJIS
言語: ja_JP.SJIS
文字コード: sjis
CONFDIR: C:/namazu/etc/namazu
LIBDIR: C:/namazu/share/namazu/pl
FILTERDIR: C:/namazu/share/namazu/filter
TEMPLATEDIR: C:/namazu/share/namazu/template
対応メディアタイプ:
application/excel
application/ichitaro4
application/ichitaro5
application/ichitaro6
application/ichitaro7
application/msword
application/rtf
application/x-gzip
application/x-js-taro
message/news
message/rfc822
text/hnf
text/html
text/html; x-type=mhonarc
text/plain
text/plain; x-type=rfc
text/x-hdml
------------
--- zcat --version ---
コマンドまたはファイル名が違います.
------------
--- gzip --version ---
コマンドまたはファイル名が違います.
------------
--- groff --version ---
コマンドまたはファイル名が違います.
------------
--- jgroff --version ---
コマンドまたはファイル名が違います.
------------
--- pdftotext -v ---
コマンドまたはファイル名が違います.
------------
--- xlhtml ---
コマンドまたはファイル名が違います.
------------
--- wvhtml ---
コマンドまたはファイル名が違います.
------------
--- wvversion ---
コマンドまたはファイル名が違います.
------------
--- gcc --version ---
コマンドまたはファイル名が違います.
------------
--- make --version ---
コマンドまたはファイル名が違います.
------------
--- gettext --version ---
コマンドまたはファイル名が違います.
------------
--- autoconf --version ---
コマンドまたはファイル名が違います.
------------
--- automake --version ---
コマンドまたはファイル名が違います.
------------
--- libtool --version ---
コマンドまたはファイル名が違います.
------------
--- File::MMagic ---
1.13
--- NKF ---
1.92
--- Text::Kakasi ---
1.05
--- Text::Chasen ---
Not found Text::Chasen !!!
--- web server ---
HTTP/1.0 200 OK
Server: Microsoft-PWS-95/2.0
Date: Thu, 22 Aug 2002 02:11:18 GMT
Content-Type: text/html
Accept-Ranges: bytes
Last-Modified: Fri, 18 Oct 1996 02:00:00 GMT
Content-Length: 1181