When analyzing user-provided text, we are frequently interested only in meaningful words; punctuation, spaces, and conjunctions might easily get in our way. Suppose you want to count word frequencies in a book, you don't want to end up with "world" and "world" being counted as two different words.
Cleanup text
How to do it...
You have to perform the following steps:
- Supply the text you want to clean up:
txt = """And he looked over at the alarm clock,
ticking on the chest of drawers. "God in Heaven!" he thought.
It was half past six and the hands were quietly moving forwards,
it was even later than half past, more like quarter to seven.
Had the alarm clock not rung? He could...