Deal with broken characters.
Make a better layout detector. Every character on its line.
Separate (more) merged characters.
Deal better with frames, lines, pictures, etc.
Change to ISO_8859-15 (update for ISO_8859-1 with euro sign).
Add an option for recognizing ISO_8859-9 chars (Turkish).
