Chris Myers (CAC), Jeff Sale (SDSC)
Cornell Center for Advanced Computing and San Diego Supercomputing Center

Revisions: 6/2023, 2/2020 (original)

Data visualization is a key component in the data science process, from the initial stages of data exploration to the end game of communicating complicated results to one's audience. Making sense of patterns in data is often facilitated by human cognitive capabilities of pattern recognition, and graphical summaries of a large and complex dataset can often suggest particular directions for further quantitative analysis. There is of course an old adage that "seeing is believing", but in some cases wth complex datasets, it is sometimes the case that "seeing is understanding", in that useful data visualizations can help us to understand the data in its entirety (at least in broad strokes). In this topic, we introduce various techniques for visualizing different types of data and describe tools in the Python data science ecosystem to support those activities.

Objectives

After you complete this segment, you should be able to:

  • Understand different types of visualization techniques and data representation methods
  • Use the Matplotlib, Seaborn, and Bokeh libraries for data visualization
  • Integrate visualization packages with other forms of data processing in Python to craft custom data visualizations
  • Understand some of the differences between static and interactive data visualizations
Prerequisites

This tutorial assumes the reader has some working knowledge of general programming concepts, even if not directly with the Python programming language. The target audience is scientists and engineers who are already programming in Python, and are interested in using Python tools and packages to carry out various analyses of datasets. If additional introductory material about Python is needed, readers can consult An Introduction to Python as well as the documentation on the python.org website.

 
©  |   Cornell University    |   Center for Advanced Computing    |   Copyright Statement    |   Inclusivity Statement