Full text search only understands whitespace as a word seperator

Bug #29227 reported by Matt Zimmerman
128
This bug affects 7 people
Affects Status Importance Assigned to Milestone
Launchpad itself
Triaged
Low
Unassigned

Bug Description

Searching for "pmu" doesn't find "/dev/pmu"

https://launchpad.net/distros/ubuntu/+bugs?field.searchtext=%2Fdev%2Fpmu&search=Search&orderby=-priority%2C-severity

does not find bug #28855, entitled "Gnome complains about wrong permissions for /dev/pmu on login"

See the duplicates report for other examples.

Dafydd Harries (daf)
Changed in launchpad:
assignee: nobody → stub
Revision history for this message
Sebastien Bacher (seb128) wrote :

It doesn't find "Bug #29776 in control-center (Ubuntu): "/dev/pmu permissions wrong"" neither

Revision history for this message
Matthew Paul Thomas (mpt) wrote :

In the long term this is a duplicate of bug 2795, though this particular example could be made to work by declaring "/" to be a word separator.

Revision history for this message
Stuart Bishop (stub) wrote :

We can't fix bug 2795 in the short term without introducing scalability problems.

mpt - should I go ahead and make '/' a word seperator? Any other punctuation? I don't think "-", "." or "'" should be. @could be argued either way. &, | and ! are boolean operators.

Changed in malone:
assignee: stub → mpt
status: Unconfirmed → Confirmed
Revision history for this message
Matthew Paul Thomas (mpt) wrote :

Yes, as the MaloneSearch spec is now, a bug mentioning /dev/pmu should be returned in a search for "pmu", and in a search for "dev pmu". Is that reasonable?

Changed in malone:
assignee: mpt → stub
Revision history for this message
Matt Zimmerman (mdz) wrote :

Having '/' as a word separator sounds entirely reasonable, sane and consistent with other applications. Any reason not to do this?

What are the existing word separators?

Revision history for this message
Stuart Bishop (stub) wrote :

The existing word seperators were just whitespace.

The first half of this has been implemented, based on directions set forward in the https://launchpad.net/products/malone/+spec/search specification. Queries from users are now being split into components the way we want and sent to the text searching engine. We still need to update the code that builds these indexes so that these updated searches actually match things though, as '/dev/pmu' is still stored as just '/dev/pmu' in the indexes rather than 'dev pmu devpmu'.

Changed in malone:
status: Confirmed → In Progress
Revision history for this message
Brad Bollenbach (bradb) wrote : Re: [Bug 29227] Searching for "pmu" doesn't find "/dev/pmu"

On 12-Feb-06, at 8:19 PM, Stuart Bishop wrote:

> Public bug report changed:
> https://launchpad.net/malone/bugs/29227
>
> Changed in: Malone (upstream)
> Status: Confirmed => In Progress

This doesn't appear to be working yet; any news on this Stuart?

Cheers,

--
Brad Bollenbach

Revision history for this message
Stuart Bishop (stub) wrote : Re: Searching for "pmu" doesn't find "/dev/pmu"

I haven't looked at this since my comment on Feb 13th.

Stuart Bishop (stub)
Changed in malone:
importance: Critical → High
Revision history for this message
Christian Reis (kiko) wrote :
Revision history for this message
Henrique Ferreiro (henrique-ferreiro) wrote :

There is another problem when searching for words with a dot in them:

Searching for "menu.lst" finds bug 61108 but not bug 78901.
Both of them have the word menu.lst in theirs subject and description.

Revision history for this message
Christian Reis (kiko) wrote :

This has gone unnoticed for too long, going Critical to get it on the bug report report.

Changed in malone:
importance: High → Critical
Revision history for this message
Stuart Bishop (stub) wrote :

It isn't a critical bug, and not one suitable to a quick fix. It is most suitable for waiting until we are running PostgreSQL 8.2 which will help a lot with our outstanding searching issues.

Changed in launchpad:
importance: Critical → High
Stuart Bishop (stub)
Changed in launchpad:
status: In Progress → Confirmed
description: updated
Revision history for this message
Björn Tillenius (bjornt) wrote : Re: [Bug 29227] Re: Full text search doesn't support substring match

On Tue, Feb 12, 2008 at 06:57:30PM -0000, Francis J. Lacoste wrote:
> ** Summary changed:
>
> - Searching for "pmu" doesn't find "/dev/pmu"
> + Full text search doesn't support substring match

Why couldn't we change the word separators, so that /dev/pmu gets treated
as two words, 'dev' and 'pmu'?

Revision history for this message
Jean Jordaan (jean-jordaan) wrote :

I've just filed another duplicate bug. Three years and counting to add sane word separators? I recommend replacing the search function with google site search in the meantime to at least get functioning search ;-)

Revision history for this message
Andrew Johnson (anj) wrote :

If there is still some reason why you can't change the indexer's list of word separators yet, can you not at least temporarily modify the word separators used to analyze the query, so that a query for "/dev/pmu" will find bugs containing " /dev/pmu " (or in my case a query for "mantis-275" will find a bug containing the string " mantis-275 "). Currently search is totally broken and almost useless since even an exact text match of a white-space delimited word containing hyphens, dots or slashes doesn't find them at all.

Revision history for this message
Robert Collins (lifeless) wrote :

Unassigning to reflect reality ;)

Changed in launchpad-foundations:
assignee: Stuart Bishop (stub) → nobody
Curtis Hovey (sinzui)
Changed in launchpad:
importance: High → Low
William Grant (wgrant)
tags: added: bug-search
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.