Project Review: Text Analysis
Counting Words and Letters Found Within a Text String
Features
Objectives
To use Python to calculate the number of instances of a given word and letter within a text string.
This was one of my earlier scripting projects that I completed before I ever started to build a Django portfolio in earnest. I added it to my Django portfolio at a later date and built upon it with additional polish features both from a refactoring of back end code perspective and improved presentation of the tables on the front-end.
The Approach & Solution
I started by thinking about the algorithm and approach to calculate the frequency of words and letters within the text block. It seemed reasonable to me that I would need to split the string into two operational lists.
The first would be a list of words that could be looped over and counted for their frequency of occurrence.
The second would be a list of letters that could be looped over and counted for their frequency of occurrence.
The next part would be to sort the lists in a logical order so that they could be rendered usefully to the user.
For words, it made sense that the list should be sorted by frequency of occurrence. For letters, it made sense that the list should be sorted alphabetically.
Evaluation
Following a refactor of the project, the code is now easier to understand and maintain.
The code has been extracted into a 'views.py' file where functions relating to the rendering of Django views lives.
The remainder of the code are utility functions that have been moved into a separate module called 'utils.py'.