You only need the pdftotext program, get it with
Code: Select all
sudo apt-get install poppler-utils
Code: Select all
lesspipe The_Book.pdf
Code: Select all
lesspipe The_Book.pdf > The_Book.txt
Code: Select all
pdftotext The_Book.pdf
And lesspipe can also help you read the damned .gz files in /usr/share/doc/, like:
Code: Select all
lesspipe /usr/share/doc/mc/HACKING.gz
Here you have the choice: catdoc and antiword do the conversion from .doc to ASCII quite perfectly. You can even create a LaTeX file.
Code: Select all
catdoc -a Worksheet.doc #create an ASCII file
catdoc -l Worksheet.doc #create a LaTeX
antiword Worksheet.doc | less # create an ASCII and send it to less
Again, catdoc can do the conversion, this time to csv:
Code: Select all
xls2csv Spreadsheet1.xls