// the datavedas project

Data Science, Structured.

DataVedas is an ongoing effort to map the world of data science by bringing together concepts from mathematics, statistics, programming, machine learning, and AI in a structured and connected way.

Raw Data
a single observation

// why datavedas

A modern interpretation of an old idea.

Data science knowledge is spread across books, research papers, tutorials, documentation, and industry practice. Inspired by how Vyasa compiled the Vedas, DataVedas brings these ideas together into a connected structure that can be explored as a whole.

// from collection to connection
fragmented converging structured

// the path

Building Knowledge, One Layer at a Time

Each section combines theory with practical implementation in Python, R, and other tools. The sequence reflects how understanding develops in practice, with each section building upon concepts introduced before it.

// theory

Understand the "why".

Before learning how a technique is implemented, it helps to understand why it exists. DataVedas introduces concepts through intuition, examples, visual explanations, and mathematical reasoning before moving to code.

mean median
  • Mean → average value
  • Median → middle observation
  • Mode → most frequent value
  • Outlier shifts mean →

// theory, then application

Then run it.

Once the underlying ideas are clear, each concept is implemented using Python, R, and other tools through practical examples and runnable code.

descriptive_stats.py
# central tendency in three lines
import pandas as pd
df = pd.read_excel("diamonds.xls")

df["price"].mean()      # 3932.8
df["price"].median()    # 2401.0
df["price"].mode()[0]   # 605

// author's pick

Favourite reads

// where to begin

Start at the foundation.

DataVedas is designed to be explored in sequence, with each section building upon the last. If you're unsure where to begin, start at the foundation and follow the path forward.

Explore the star map
ESC
100 pages indexed · Esc to close