CoderzColumn : Data Science Tutorials (Page: 1)

Data Science Tutorials


Data science is an interdisciplinary field that applies information from data across a wide range of application fields by using scientific methods, procedures, algorithms, and systems to infer knowledge and insights from noisy, structured, and unstructured data.

Data visualization libraries like matplotlib, Bokeh, bqplot, Plotnine, cufflinks, Altair, hvplot, Holoviews, seaborn and more.

Different Interactive charts Sunburst Charts, Sankey Diagrams (Alluvial), Candlestick Charts, Network Charts, Chord Diagram, Parallel Coordinates Plots, Radar Charts, Connection Map, Treemap, Choropleth Maps, Scatter & Bubble Maps.

Apart from this, you will find tutorials about time series data and its applications, creating dashboards, and other concepts.

For an in-depth understanding of the above concepts, check out the sections below.

Recent Data Science Tutorials


Tags bqplot, interactive-plots
bqplot - Interactive Plotting in Python Jupyter Notebook
Data Science

bqplot - Interactive Plotting in Python Jupyter Notebook

A detailed guide on how to create interactive charts in Jupyter Notebook using Python library bqplot. Tutorial explains various chart types like scatter charts, bar charts, line charts, heatmaps, and many more. All charts are created using pyplot API of bqplot which is same as pyplot API of matplotlib. Tutorial is a good starting point for someone new to bqplot.

Sunny Solanki  Sunny Solanki
Tags bokeh, data-visualization
Bokeh - Basic Interactive Plotting in Python [Jupyter Notebook]
Data Science

Bokeh - Basic Interactive Plotting in Python [Jupyter Notebook]

A detailed guide on how to use Python data visualization library Bokeh to create interactive charts. Tutorial covers various chart types like bar charts, scatter charts, line charts, pie charts, heatmaps, box whisker plots, and many more. Tutorial is a good starting point for someone who wants to learn bokeh.

Sunny Solanki  Sunny Solanki
Tags holoviews, basic-plots
Getting Started with Holoviews - Basic Interactive Plotting [Python]
Data Science

Getting Started with Holoviews - Basic Interactive Plotting [Python]

A detailed guide on how to use Python data visualization library Holoviews to create interactive charts. Holoviews is built on top of bokeh, matplotlib, and plotly. It lets us create charts using one of them as a backend. The tutorial covers various charts like scatter plots, bar charts, histograms, violin plots, and histograms. It even covers how to combine simple charts to create complicated charts.

Sunny Solanki  Sunny Solanki
Tags bokeh, animation
Simple Guide to Create Animation using Bokeh
Data Science

Simple Guide to Create Animation using Bokeh

A detailed guide on how to create animation in Python using data visualization library Bokeh. Tutorial creates different types of chart animation like bar chart animation, candlestick chart animation, bubble chart animation, etc.

Sunny Solanki  Sunny Solanki
Tags Bokeh, widgets, Apps, GUI
Simple Guide to use Bokeh Widgets (Interactive GUI / Apps)
Data Science

Simple Guide to use Bokeh Widgets (Interactive GUI / Apps)

A simple guide on how to create interactive GUI / apps with widgets using Python Data viz library Bokeh. Tutorial explains how we can use widgets (dropdowns, radio buttons, checkboxes, date pickers, sliders, etc) available from bokeh with simple examples. Bokeh apps explained in tutorial use Python callbacks for updating charts.

Sunny Solanki  Sunny Solanki
Tags plotnine-charts, annotations
Add Annotations to Plotnine Charts
Data Science

Add Annotations to Plotnine Charts

A simple guide on how to add annotations to plotnine charts with simple examples. Plotnine is a Python data viz library that let us create static charts. Tutorial explains annotations like text labels, arrows, boxes, polygons, spans, slopes, etc.

Sunny Solanki  Sunny Solanki
Tags Pandas-Bokeh, interactive-charts
Pandas-Bokeh: Create Bokeh Charts from Pandas DataFrame with One Line of Code
Data Science

Pandas-Bokeh: Create Bokeh Charts from Pandas DataFrame with One Line of Code

A detailed guide to Python data visualization library Pandas_bokeh that let us create interactive bokeh charts from pandas dataframe with just one simple function call. Tutorial covers various charts (scatter, bar, step, line, pie, histogram, scatter maps, bubble maps, etc) available from library with simple examples.

Sunny Solanki  Sunny Solanki
Tags bokeh-charts, annotations
How to Add Annotations to Bokeh Charts?
Data Science

How to Add Annotations to Bokeh Charts?

A simple guide to add annotations to charts created using Python data viz library bokeh. Tutorial explains different annotations like arrows, labels, polygons, spans, ranges, bound, boxes, etc. We have also covered styling of these annotations with simple examples.

Sunny Solanki  Sunny Solanki
Tags bokeh, ipywidgets, widgets
How to Link Bokeh Charts with IPywidgets widgets to Dynamically Update Charts?
Data Science

How to Link Bokeh Charts with IPywidgets widgets to Dynamically Update Charts?

A simple guide to create interactive GUI using Python data viz library Bokeh and widgets library ipywidgets. Tutorial explains how we can link ipywidgets widgets (like dropdowns, checkboxes, sliders, etc) with Bokeh charts to dynamically update them with changes in widget states.

Sunny Solanki  Sunny Solanki
Tags candlestick, mplfinance, plotly, bokeh, bqplot, c…
Candlestick Chart in Python (mplfinance, plotly, bokeh, bqplot & cufflinks)
Data Science

Candlestick Chart in Python (mplfinance, plotly, bokeh, bqplot & cufflinks)

A simple guide to creating candlestick charts in Python using data visualization libraries mplfinance (matplotlib), Plotly, Bokeh, Bqplot, and Cufflinks. The tutorial covers a simple styling guide as well. Charts created using mplfinance are static whereas interactive for other libraries.

Sunny Solanki  Sunny Solanki
Python Data Visualization Libraries

Python Data Visualization Libraries


Data Visualization is a field of graphical representation of information / data. It is one of the most efficient ways of communicating information with users as humans are quite good at capturing patterns in data.

Python has a bunch of libraries that can help us create data visualizations. Some of these libraries (matplotlib, seaborn, plotnine, etc) generate static charts whereas others (bokeh, plotly, bqplot, altair, holoviews, cufflinks, hvplot, etc) generate interactive charts. Majority of basic visualizations like bar charts, line charts, scatter plots, histograms, box plots, pie charts, etc are supported by all libraries. Many libraries also support advanced visualization, widgets, and dashboards.

Advanced Data Visualizations using Python

Advanced Data Visualizations using Python


Basic Data Visualizations like bar charts, line charts, scatter plots, histograms, box plots, pie charts, etc are quite good at representing information and exploring relationships between data variables.

But sometimes these visualizations are not enough and we need to analyze data from different perspectives. For this purpose, many advanced visualizations are developed over time like Sankey diagrams, candlestick charts, network charts, chord diagrams, sunburst charts, radar charts, parallel coordinates charts, etc. Python has many data visualization libraries that let us create such advanced data visualizations.

Dashboards using Python

Dashboards using Python


Dashboards are literally everywhere and everyone is using them. Dashboards are GUI with various visualizations and metrics that can be used to monitor key performance indicators. Dashboards have a very wide range of applications in all fields.

Python has a bunch of libraries (dash, panel, streamlit, bokeh, etc) that let us create dashboards using them. They let us include widgets and interactive data visualizations in dashboards.

Work with Time Series Data in Python

Work with Time Series Data in Python


Time series is a type of data where data points are recorded in time order or at specified time intervals. Many real-world datasets like stock prices, weather indicators, heights of ocean tides, retail sales, etc.

Time series analysis involves various tasks like resampling time series, trying moving window functions, forecasting, classification, etc.

Python has various libraries (pandas, statsmodels, etc.) that let us load and work with time series data efficiently. They even provide useful functionalities to work with time series data.

Visualize Maps using Python

Visualize Maps using Python


Maps are one of the best ways to display and analyze geospatial data. It helps us better see patterns and trends geographically. This can help us with better decision-making.

Many different types of maps have been developed over time to analyze data from different perspectives. Some common map visualization types are choropleth maps, scatter maps, bubble maps, connection maps, etc. Apart from these, we can also include pins on maps to identify locations.

Python has many different libraries (geopandas, folium, ipyleaflet, cartopy, geoviews, geoplot, bokeh, altair, plotly, hvplot, etc) that let us create static as well as interactive maps.

Exploratory Data Analysis using Python

Exploratory Data Analysis using Python


Exploratory data analysis (commonly referred to as EDA) is an initial analysis of data to look for various relationships, anomalies, missing values, distributions, basic statistics, etc. It helps us understand data better to make further decisions. Various stats are calculated and statistical visualizations are created during EDA to understand data.

Python provides many different tools / libraries (Sweetviz, missingno, seaborn, pandas, etc) for performing EDA. It's quite common to use more than one of these tools to perform EDA.