The code, corpus, and regex examples from today’s class have been posted. (Clone the class github repo if you want to do these things the right way.)
The repo also contains the htmlstripper.py program which you can use and/or modify for whatever purposes you may desire, if useful.