This page shows error reports on the content and structure components of the weekly ODP data dumps. Each dump file is checked for illegal UTF-8 byte sequences, illegal XML characters, and for XML well-formedness.
The ODP data exports used for this report may be downloaded from http://rdf.dmoz.org/rdf/
The source for the dumpcheck program may be downloaded from the ODP software page
This page was moved from it's original location on March 28, 2007 and is now hosted by editor jtaylorj.
For more information on known bugs and feature requests for ODP dumps, see my bug tracking page.
dumpcheck version 1.9 (libxml2 v2.6.16) 920953568 bytes processed UTF-8 Sequence error(s): 0 XML Character encoding error(s): 0 W3C XML well-formedness: PassedDetailed structure error listing
dumpcheck version 1.9 (libxml2 v2.6.16) 1833725547 bytes processed UTF-8 Sequence error(s): 0 XML Character encoding error(s): 0 W3C XML well-formedness: PassedDetailed content error listing