Visualize to Realize your Data

By now, if you have experienced the perks of being a typical Data Analyst and Machine Learning Enthusiast, you would just love exploring with the data and the storyline behind everything you code!

Being a Data Scientist, you are not only a Computer Engineer or Programmer, you are just an all rounder! Though it’s seeming an exaggeration, you are hearing the fact! Being in this field, you are actually exploring the changing world!

While it is the need of the hour for analysis, it is important to give you a presentable view of what people have to conveyed. Not everyone are acquainted with data and its analysis, and so, this is the time where the famous saying “A picture conveys thousand words” comes into picture.

Today let us dive into some important attributes of Data Visualization.
From our grade 1, we have been looking into Data Handling! Ofcourse, it used to be in the top list of my favorite courses, as Data Handling is just to easy :D
But, after being an actual practitioner of Data Handling, now I realize the how crucial each of the Data Visualization tool each one holds for!

Let us now see different representations of the Data and its usage.

Line Plot

Line charts are best to show trends over a period of time, and multiple lines can be used to show trends in more than one group.

Bar Plot

Bar charts are useful for comparing quantities corresponding to different groups.


Heatmaps can be used to find color-coded patterns in tables of numbers.

Scatter plots

Scatter plots show the relationship between two continuous variables; if color-coded, we can also show the relationship with a third categorical variable

Regression line

Including a regression line in the scatter plot makes it easier to see any linear relationship between two variables.

Swarm Plot

Categorical scatter plots show the relationship between a continuous variable and a categorical variable.


Histograms show the distribution of a single numerical variable.

KDE plots

KDE plots (or 2D KDE plots) show an estimated, smooth distribution of a single numerical variable (or two numerical variables).

This is the basic view of the different visualizations we use according to the scenario.




Data Science Aspirant

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Data types in statistics explained simply

Key Terms For Data Types

8 Mistakes to avoid while using Machine Learning

EDA (Exploratory Data Analysis) - to get more insight into the data

What is Design of Experiments?

Dataset investigation: reading information and checking values using Pandas

Visual AI, trained on my Memories

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Praseeda Saripalle

Praseeda Saripalle

Data Science Aspirant

More from Medium

Behavior Analysis

Learning Organic Chemistry in Chinese

IS YOUR DATA REALLY SAFE ? : Data Protection in Africa

Pull to Regression, The Understanding