Wikipedia Dump Reader

Registered 2008-05-17 by Benjamin Thyreau

An application to easily read Wikipedia's downloaded dump files.

This simple program displays the text-only Wikipedia compressed dumps, currently available at http://download.wikimedia.org/backup-index.html, generally named like pages-articles.xml.bz2.

It's fairly useable now for wikipedia reading, altough lots of rendering or layout glitch occurs.
It is focused on usability, and not necessarily trying to mimic the online web interface.

Features includes a Qt viewer with basic text mark-up, following links, ability to read directly on the .bz2 compressed file (although some index creations step is needed on first run), tab-like list of articles with load-in-the-background by default, a simple but useful keyword search, very light source-code, optional latex rendering, no install necessary.

Project information

Maintainer:
Benjamin Thyreau
Driver:
Not yet selected
Development focus:

trunk series 

lp:wikipediadumpreader 
Browse the code

Programming Languages:
Python
Licences:
Simplified BSD Licence, GNU GPL v2
(The Qt-dependant code is required to be GPL, the rest of the code is BSD)

RDF metadata

View full history Series and milestones

Wikipedia Dump Reader trunk series is the current focus of development

Get Involved

  • Report a bug
  • warning
    Ask a question
  • warning
    Help translate

Downloads

Latest version is 0.2.10
released on 2009-08-16

All downloads

Announcements