Namazu-users-en(old)
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
wwwoffle-mknmz-lasttime just adds, not replaces
- From: Dan Jacobson <jidanni@xxxxxxxxxxx>
- Date: Fri, 23 Apr 2004 09:22:26 +0800
- X-ml-name: namazu-users-en
- X-mail-count: 00497
I notice that namazu indexing lasttime does not replace the record of
older version of the same page, but just adds a record:
1. Amateur Radio Antenna Projects (score: 245)
Author: unknown
Date: Sat, 17 Apr 2004 06:05:37
AC6V's HOMEBREW ANTENNAS
http://ac6v.com/antprojects.htm (89,772 bytes)
3. Amateur Radio Antenna Projects (score: 110)
Author: unknown
Date: Wed, 04 Apr 2001 00:42:10
AC6V's HOMEBREW ANTENNAS
http://ac6v.com/antprojects.htm (8,213 bytes)
hmmm, in /var/cache/wwwoffle/search/namazu/scripts/wwwoffle-mknmz-lasttime
I already added
-Y, --no-delete do not detect removed documents.
which I know means don't wipe out the database.
I suppose
-Z, --no-update do not detect update and deleted documents.
means also don't update records of URLs already in the database, so I
won't try it. Sure wish they were explained further.
Ok, now trying gcnmz. [2 hours later:] Nope, duplicates still there:
$ namazu rhombic\ cebik /var/cache/wwwoffle/search/namazu/db/|grep ^http|sort
http://ac6v.com/antprojects.htm (8,213 bytes)
http://ac6v.com/antprojects.htm (89,772 bytes)
[Wonder what happens when gcnmz is running and one runs wwwoffle-mknmz-*]
By the way, if a page doesn't have a <TITLE>, search results use the
wwwoffle hash -- ugly. Better would be the URL or its last part.
2. DMVQBFUUoipjJTO4Fudy1NA (score: 348)
Author: unknown
Date: Thu, 08 Jan 2004 00:08:26
Element 4 Extra Class Question...
http://www.aa9pw.com/fcc/amateur/pools/2002_Extra_Pool3_.txt (65,440 bytes)