Text Analytics 1: Introduction to Natural Language Processing
About this Course
Introducing Natural Language Processing is part one of the Text Analytics with Python professional certificate (or you can study it as a stand-alone course). This first course introduces the core techniques of natural language processing (NLP) and computational linguistics. But we introduce these techniques from data science alongside the cognitive science that makes them possible. How can we make sense out of the incredible amount of knowledge that has been stored as text data? This course is a practical and scientific introduction to natural language processing. That means you’ll learn how it works and why it works at the same time. On the practical side, you’ll learn how to actually do an analysis in Python: creating pipelines for text classification and text similarity that use machine learning. These pipelines are automated workflows that go all the way from data collection to visualization. You’ll learn to use Python packages like pandas, scikit-learn, and tensorflow. On the scientific side, you’ll learn what it means to understand language computationally. Artificial intelligence and humans don’t view documents in the same way. Sometimes AI sees patterns that are invisible to us. But other times AI can miss the obvious. We have to understand the limits of a computational approach to language and the ethical guidelines for applying it to real-world problems. For example, we can identify individuals from their tweets. But we could never predict future criminal behaviour using social media. This course will cover topics you may have heard of, like text processing, text mining, sentiment analysis, and topic modeling.Created by: University of Canterbury
Level: Introductory

Related Online Courses
The course introduces the basic elements and trends for performing a through built environment assessment and develops the tools for urban sustainability policy impact analysis. During the first... more
Most data science projects fail. There are various reasons why, but one of the primary reasons is the challenge of deployment. One piece to the deployment puzzle is understanding how to automate... more
Apache® Spark™ is a fast, flexible, and developer-friendly open-source platform for large-scale SQL, batch processing, stream processing, and machine learning. Users can take advantage of its op... more
This course introduces the concept of a designer working for themselves and creating their own design-driven brand. It requires learners to apply ideas to a real-life setting. By the end of this... more
This course is the first of a two-course sequence: Introduction to Computer Science and Programming Using Python, and Introduction to Computational Thinking and Data Science. Together, they are... more