Hi there! It's Malwina. I write programs to make things work.
I'm a biotechnologist who fell in love with robots and AI. I use Python to bring my ideas to life.
View My LinkedIn Profile
Simple text processing, part 2. First steps with regular expressions (regex) in Python.
Created as training project to practice regular expressions and process text without external natural language processing tools.
Text source: Lectures on The Language: An Introduction to the Study of Speech by Edward Sapir, 1921, www.gutenberg.org
Actions: extracting longest word&sentence, ten most frequent words, named entities
Input data
Input text: Lectures on The Language: An Introduction to the Study of Speech by Edward Sapir, 1921 (Sapir1921_chapter1.txt)
Stopwords list (stopwordlist.txt) to be removed from input text
Project is finished. It was done to excercise regex in Python.
To see the code of funny games, click here