I am delighted to announce the release of my first Python package, englishidioms. This package is a powerful tool for identifying English idioms, phra

Search code, repositories, users, issues, pull requests...

submited by
Style Pass
2024-04-29 12:30:08

I am delighted to announce the release of my first Python package, englishidioms. This package is a powerful tool for identifying English idioms, phrases, and phrasal verbs within sentences, with a database of 22,209 unique expressions.

Background: As part of a personal Natural Language Processing (NLP) project, I found the need for a Python package capable of recognizing idiomatic expressions within English sentences. To my surprise, I couldn't find a suitable solution, so I decided to build one from scratch.

This package was developed by parsing all dictionary entries in the "McGraw-Hill Dictionary of American Idioms and Phrasal Verbs." By using Python libraries like python-docx and leveraging predefined formatting, I extracted key points from each entry. These key points include:

Constants: Fixed classes of words that form the core of the idiomatic expression. Variables: Variable classes of words that can be broadly matched. Optional Constants: Additional optional words or phrases within a single idiomatic expression. Verbs: Verbs that usually precede the idiomatic phrase.

Leave a Comment