attachments - indexed content of pdf not very good
Bug #758717 reported by
Ferdinand
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Odoo Addons (MOVED TO GITHUB) |
Fix Released
|
Low
|
OpenERP R&D Addons Team 1 |
Bug Description
IMHO it's not an OpenERP problem, but a pdf2txt issue.
nevertheless OpenERP will be blamed if the content is not correctly indexed.
may be the page should say "experimental" somewhere
Related branches
lp:~openerp-dev/openobject-addons/trunk-bug-758717-uco
- OpenERP Core Team: Pending requested
-
Diff: 12 lines (+1/-1)1 file modifieddocument/document_view.xml (+1/-1)
Changed in openobject-addons: | |
status: | Confirmed → In Progress |
Changed in openobject-addons: | |
status: | Fix Committed → Fix Released |
To post a comment you must log in.
On Tuesday 12 April 2011, you wrote:
> Public bug reported:
>
> IMHO it's not an OpenERP problem, but a pdf2txt issue.
> nevertheless OpenERP will be blamed if the content is not correctly
> indexed. may be the page should say "experimental" somewhere
>
I always tell people that all content indexers *try* to extract the text of
the files. It doesn't mean that they will always produce the same result as a
human eye would read at the same files...
I'm afraid there is little we can do there.
However, if you see at the code, it is modular enough so that anybody that can
provide better tools (say, a replacement for pdf2txt) can plug them in and
improve the indexing.