pixt converts files to text by making system calls. pixt is just a wrapper around a series of system calls, thus making it reasonably extensible. A pixt generator can be written for any file type. An example below illustrates a simple pdf conversion.

p = pixt.PiXt()
p.settype('text')
data = p.convert('myfile.txt')

The example above creates a pixt object, defines the conversion type (or item being read) as text, and then returns the file's data. The general concept involves reading performing the conversion, and returning the resulting text, pixt only ever returns a string of text. This means that anything that is in a text file can be automatically read, and is in now way massaged. Eventually pixt might support the notion of formatters to handle objectifying text objects. As such, even a text file needs a conversion plugin (and has one).

Next: Writing A Generator
Download pixt