Logo

Hi there! It's Malwina. I write programs to make things work.

I'm a biotechnologist who fell in love with robots and AI. I use Python to bring my ideas to life.

View My LinkedIn Profile

View My GitHub Profile

text processing with regex

Simple text processing, part 2. First steps with regular expressions (regex) in Python.

text

Table of contents

General info

Created as training project to practice regular expressions and process text without external natural language processing tools.

Text source: Lectures on The Language: An Introduction to the Study of Speech by Edward Sapir, 1921, www.gutenberg.org

Actions: extracting longest word&sentence, ten most frequent words, named entities

Technologies

Setup

Input data

Input text: Lectures on The Language: An Introduction to the Study of Speech by Edward Sapir, 1921 (Sapir1921_chapter1.txt)

Stopwords list (stopwordlist.txt) to be removed from input text

Status

Project is finished. It was done to excercise regex in Python.

Get the code

To see the code of funny games, click here