Do More Polarizing YouTube Titles Get More Views?

Accompanying YouTube Video: We’re all familiar with the concept of clickbait now, so I was curious if more polarizing YouTube titles got more views. Just like the last couple of articles involving YouTube, we’ll be using Selenium, BeautifulSoup, and The Text API. Just like when we found The Most Common Phrases on the Front PageContinue reading “Do More Polarizing YouTube Titles Get More Views?”

What Are The Most Common Phrases on YouTube’s Front Page?

Accompanying YouTube video: Have you ever wondered how to make the front page of YouTube? I certainly have, so I used the best sentiment analysis API out there, The Text API, in combination with Selenium and Beautiful Soup to find out what the most common phrases in the titles on YouTube’s front page are. We’reContinue reading “What Are The Most Common Phrases on YouTube’s Front Page?”

The Best Way to do Named Entity Recognition (NER)

Named Entity Recognition (NER) is a common Natural Language Processing technique. It’s so often used that it comes in the basic pipeline for spaCy. NER can help us quickly parse out a document for all the named entities of many different types. For example, if we’re reading an article, we can use named entity recognitionContinue reading “The Best Way to do Named Entity Recognition (NER)”

Natural Language Processing: Part of Speech Tagging

Part of Speech (POS) Tagging is an integral part of Natural Language Processing (NLP). The first step in most state of the art NLP pipelines is tokenization. Tokenization is the separating of text into “tokens”. Tokens are generally regarded as individual pieces of languages – words, whitespace, and punctuation. Once we tokenize our text weContinue reading “Natural Language Processing: Part of Speech Tagging”