Use XPath to parse LP-Pages
Bug #93499 reported by
Markus Korn
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
python-launchpad-bugs |
Fix Released
|
Wishlist
|
Markus Korn |
Bug Description
So far we are using Regular Expressions to parse the LP-Pages. These RegEx are mostly complicated and hard to maintain. The usage of XPath is more intuitive.
The attached patch against bughelper.main r118 provides a implementation of XPath. The Html-code of the LP-Pages is parsed by libxml2.
In some cases I was unable to replace the RegEx with a equivalent (simple) XPath-Construction.
This code needs to be tested and reviewed.
Also someone who is more familiar with XPath should review the statements and constructions i have chosen.
Markus
Changed in python-launchpad-bugs: | |
status: | Fix Committed → Fix Released |
To post a comment you must log in.
I pushed and updated and slightly modified patch to https:/ /code.launchpad .net/~bugsquad/ bughelper/ xpath - let's continue our work together in there.