Various new bits of documentation on embeded files and text extraction