Python, a multi-paradigm programming language, has become the language of choice for data scientists for data analysis, visualization, and machine learning.
In this course, you’ll start by learning how to acquire data from the web in its already “clean” format, such as in a .csv file, or a database. You’ll then learn to transform this data so it’s in its most useful format for analysis. After that, you’ll dive into data aggregation and grouping, where you’ll learn to group similar data for easier analysis purposes. From there, you’ll be shown different methods of web scraping using Python. Finally, you’ll learn to extract large amounts of data using BeautifulSoup, as well as work with Selenium and Scrapy.
About the author
Curtis Miller is Associate Instructor at the University of Utah, and an MSTAT student. He is currently involved in research on data analysis from statistical and computer science perspectives. Curtis has published research on policy and economic issues.