Namazu-users-en(old)


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

wwwoffle-mknmz-lasttime just adds, not replaces



I notice that namazu indexing lasttime does not replace the record of
older version of the same page, but just adds a record:
1. Amateur Radio Antenna Projects (score: 245)
    Author: unknown
    Date: Sat, 17 Apr 2004 06:05:37
    AC6V's HOMEBREW ANTENNAS
    http://ac6v.com/antprojects.htm (89,772 bytes)

3. Amateur Radio Antenna Projects (score: 110)
    Author: unknown
    Date: Wed, 04 Apr 2001 00:42:10
    AC6V's HOMEBREW ANTENNAS
    http://ac6v.com/antprojects.htm (8,213 bytes)

hmmm, in /var/cache/wwwoffle/search/namazu/scripts/wwwoffle-mknmz-lasttime
I already added
      -Y, --no-delete          do not detect removed documents.
which I know means don't wipe out the database.
I suppose
      -Z, --no-update          do not detect update and deleted documents.
means also don't update records of URLs already in the database, so I
won't try it.  Sure wish they were explained further.

Ok, now trying gcnmz. [2 hours later:] Nope, duplicates still there:
$ namazu rhombic\ cebik /var/cache/wwwoffle/search/namazu/db/|grep ^http|sort
http://ac6v.com/antprojects.htm (8,213 bytes)
http://ac6v.com/antprojects.htm (89,772 bytes)

[Wonder what happens when gcnmz is running and one runs wwwoffle-mknmz-*]

By the way, if a page doesn't have a <TITLE>, search results use the
wwwoffle hash -- ugly.  Better would be the URL or its last part.

2. DMVQBFUUoipjJTO4Fudy1NA (score: 348)
    Author: unknown
    Date: Thu, 08 Jan 2004 00:08:26
    Element 4 Extra Class Question...
    http://www.aa9pw.com/fcc/amateur/pools/2002_Extra_Pool3_.txt (65,440 bytes)