Data is everywhere. Most of it sits unused because nobody knows what to do with it. The difference between data and insight is analysis, and the difference between insight and action is how you present it.
This is where we talk about turning raw data into something useful—analysis techniques, visualization approaches, tools that work, patterns worth knowing about. Whether you're tracking users, analyzing markets, or just trying to understand what's happening in your product.
Why Data Science Matters for Builders
You don't need a PhD to use data effectively. You just need to ask the right questions and know how to find answers.
Product decisions backed by data beat opinions and assumptions. Usage patterns tell you what users actually do, not what they say they'll do. Metrics show what's working and what isn't.
Good visualization makes complex data understandable. A clear chart communicates in seconds what a spreadsheet takes minutes to parse.
Types of Data Analysis
Descriptive analytics: What happened? User counts, conversion rates, feature usage, revenue trends. The foundation—understanding current state.
Diagnostic analytics: Why did it happen? Correlation analysis, cohort comparisons, funnel analysis. Understanding causes behind the numbers.
Predictive analytics: What will happen? Trend forecasting, churn prediction, demand estimation. Using historical data to anticipate future.
Prescriptive analytics: What should we do? Optimization, recommendation engines, decision support. Actionable guidance from data.
Most builders need descriptive and diagnostic analytics. Start there before getting fancy.
Essential Data Skills
SQL for data querying
Most data lives in databases. SQL lets you ask questions of that data directly.
SQLBolt - Interactive SQL tutorial Mode SQL Tutorial - SQL for data analysis PostgreSQL Exercises - Practice SQL queries
Spreadsheet analysis
Google Sheets or Excel handle most analysis needs. Pivot tables, basic formulas, charts—surprisingly powerful.
Spreadsheet Formulas - Google Sheets function reference
Python for data work
When spreadsheets aren't enough, Python with pandas and numpy handles serious data manipulation.
Python for Data Analysis - Comprehensive guide by pandas creator Kaggle Learn - Free courses on Python, pandas, data viz
Statistical thinking
Understanding correlation vs causation, statistical significance, sampling bias. Not advanced math—just thinking clearly about what data means.
Statistics Done Wrong - Common statistical mistakes ***** Statistics by Charles Wheelan - Statistics without the math
Data Visualization Principles
Good visualization has clear purpose. Are you exploring data yourself or communicating findings to others? Different goals need different approaches.
Choose the right chart type
Line charts for trends over time Bar charts for comparing categories
Scatter plots for relationships between variables Pie charts almost never (seriously, bar charts work better) Heatmaps for multidimensional data
From Data to Viz - Decision tree for choosing chart types Financial Times Visual Vocabulary - Chart selection guide
Design for clarity
Remove unnecessary elements. Every pixel should serve purpose—show data, label it clearly, or get out of the way.
Use color purposefully. Highlight what matters, use consistent color schemes, consider colorblind accessibility.
Make text readable. Clear labels, readable fonts, appropriate sizes. If viewers squint, you failed.
Edward Tufte's principles - The visual display of quantitative information
Tools for Data Visualization
For builders and analysts:
Tableau Public - Free powerful visualization tool Google Data Studio / Looker Studio - Free dashboards connected to data sources Metabase - Open source analytics and dashboarding Observable - Interactive data visualization with D3.js
For developers:
D3.js - Powerful JavaScript visualization library Plotly - Interactive charts in Python, R, JavaScript Chart.js - Simple HTML5 charts Recharts - React charting library
For quick analysis:
Google Sheets - Built-in charting Excel - Advanced charting capabilities Datawrapper - Quick beautiful charts
Common Data Analysis Patterns
Funnel analysis
Track users through conversion steps. Where do they drop off? What percentage completes each stage?
Critical for: Signup flows, checkout processes, onboarding sequences
Mixpanel Funnels - Dedicated funnel analytics PostHog Funnels - Open source funnel tracking
Cohort analysis
Group users by when they signed up or first took action. Compare retention, engagement, revenue across cohorts.
Reveals: Whether product is improving over time, which acquisition channels bring better users
Amplitude Cohorts - Guide to cohort analysis
A/B test analysis
Compare two versions to see which performs better. Requires understanding statistical significance.
Evan Miller A/B Tools - Sample size and significance calculators Optimizely Stats Engine - How A/B testing stats work
Time series analysis
Analyze data over time. Identify trends, seasonality, anomalies. Forecast future based on historical patterns.
Prophet by Facebook - Time series forecasting tool ARIMA models - Statistical forecasting method
Data Ethics and Privacy
Working with data comes with responsibility. User data isn't yours—it's entrusted to you.
Collect only what you need: Don't gather data "just in case." Have specific purpose for every data point collected.
Anonymize appropriately: Remove personally identifiable information when possible. Aggregate for analysis.
Secure data properly: Encrypt sensitive data, limit access, follow security best practices.
Be transparent: Users should know what data you collect and why. Privacy policies matter.
Respect regulations: GDPR, CCPA, and other privacy laws have teeth. Compliance isn't optional.
GDPR Compliance Checklist - European privacy regulation CCPA Compliance Guide - California privacy law
Getting Started with Data Analysis
Start with questions, not data. What do you want to know? What decisions would this inform?
Use existing tools before building custom solutions. Google Analytics, Plausible, Mixpanel handle most needs.
Begin with simple analysis. Basic metrics and trends teach you more than complex models applied prematurely.
Document your findings. Analysis is worthless if insights get lost. Share what you learn.
Resources for Learning Data Science
Kaggle - Datasets, competitions, tutorials, community DataCamp - Interactive data science courses Coursera Data Science Specialization - Johns Hopkins comprehensive program r/datascience - Community and resources
Data science isn't magic. It's asking good questions, finding relevant data, analyzing it honestly, and communicating findings clearly. Master those basics and you'll make better decisions than 90% of people building products.
This is where we talk about turning raw data into something useful—analysis techniques, visualization approaches, tools that work, patterns worth knowing about. Whether you're tracking users, analyzing markets, or just trying to understand what's happening in your product.
Why Data Science Matters for Builders
You don't need a PhD to use data effectively. You just need to ask the right questions and know how to find answers.
Product decisions backed by data beat opinions and assumptions. Usage patterns tell you what users actually do, not what they say they'll do. Metrics show what's working and what isn't.
Good visualization makes complex data understandable. A clear chart communicates in seconds what a spreadsheet takes minutes to parse.
Types of Data Analysis
Descriptive analytics: What happened? User counts, conversion rates, feature usage, revenue trends. The foundation—understanding current state.
Diagnostic analytics: Why did it happen? Correlation analysis, cohort comparisons, funnel analysis. Understanding causes behind the numbers.
Predictive analytics: What will happen? Trend forecasting, churn prediction, demand estimation. Using historical data to anticipate future.
Prescriptive analytics: What should we do? Optimization, recommendation engines, decision support. Actionable guidance from data.
Most builders need descriptive and diagnostic analytics. Start there before getting fancy.
Essential Data Skills
SQL for data querying
Most data lives in databases. SQL lets you ask questions of that data directly.
SQLBolt - Interactive SQL tutorial Mode SQL Tutorial - SQL for data analysis PostgreSQL Exercises - Practice SQL queries
Spreadsheet analysis
Google Sheets or Excel handle most analysis needs. Pivot tables, basic formulas, charts—surprisingly powerful.
Spreadsheet Formulas - Google Sheets function reference
Python for data work
When spreadsheets aren't enough, Python with pandas and numpy handles serious data manipulation.
Python for Data Analysis - Comprehensive guide by pandas creator Kaggle Learn - Free courses on Python, pandas, data viz
Statistical thinking
Understanding correlation vs causation, statistical significance, sampling bias. Not advanced math—just thinking clearly about what data means.
Statistics Done Wrong - Common statistical mistakes ***** Statistics by Charles Wheelan - Statistics without the math
Data Visualization Principles
Good visualization has clear purpose. Are you exploring data yourself or communicating findings to others? Different goals need different approaches.
Choose the right chart type
Line charts for trends over time Bar charts for comparing categories
Scatter plots for relationships between variables Pie charts almost never (seriously, bar charts work better) Heatmaps for multidimensional data
From Data to Viz - Decision tree for choosing chart types Financial Times Visual Vocabulary - Chart selection guide
Design for clarity
Remove unnecessary elements. Every pixel should serve purpose—show data, label it clearly, or get out of the way.
Use color purposefully. Highlight what matters, use consistent color schemes, consider colorblind accessibility.
Make text readable. Clear labels, readable fonts, appropriate sizes. If viewers squint, you failed.
Edward Tufte's principles - The visual display of quantitative information
Tools for Data Visualization
For builders and analysts:
Tableau Public - Free powerful visualization tool Google Data Studio / Looker Studio - Free dashboards connected to data sources Metabase - Open source analytics and dashboarding Observable - Interactive data visualization with D3.js
For developers:
D3.js - Powerful JavaScript visualization library Plotly - Interactive charts in Python, R, JavaScript Chart.js - Simple HTML5 charts Recharts - React charting library
For quick analysis:
Google Sheets - Built-in charting Excel - Advanced charting capabilities Datawrapper - Quick beautiful charts
Common Data Analysis Patterns
Funnel analysis
Track users through conversion steps. Where do they drop off? What percentage completes each stage?
Critical for: Signup flows, checkout processes, onboarding sequences
Mixpanel Funnels - Dedicated funnel analytics PostHog Funnels - Open source funnel tracking
Cohort analysis
Group users by when they signed up or first took action. Compare retention, engagement, revenue across cohorts.
Reveals: Whether product is improving over time, which acquisition channels bring better users
Amplitude Cohorts - Guide to cohort analysis
A/B test analysis
Compare two versions to see which performs better. Requires understanding statistical significance.
Evan Miller A/B Tools - Sample size and significance calculators Optimizely Stats Engine - How A/B testing stats work
Time series analysis
Analyze data over time. Identify trends, seasonality, anomalies. Forecast future based on historical patterns.
Prophet by Facebook - Time series forecasting tool ARIMA models - Statistical forecasting method
Data Ethics and Privacy
Working with data comes with responsibility. User data isn't yours—it's entrusted to you.
Collect only what you need: Don't gather data "just in case." Have specific purpose for every data point collected.
Anonymize appropriately: Remove personally identifiable information when possible. Aggregate for analysis.
Secure data properly: Encrypt sensitive data, limit access, follow security best practices.
Be transparent: Users should know what data you collect and why. Privacy policies matter.
Respect regulations: GDPR, CCPA, and other privacy laws have teeth. Compliance isn't optional.
GDPR Compliance Checklist - European privacy regulation CCPA Compliance Guide - California privacy law
Getting Started with Data Analysis
Start with questions, not data. What do you want to know? What decisions would this inform?
Use existing tools before building custom solutions. Google Analytics, Plausible, Mixpanel handle most needs.
Begin with simple analysis. Basic metrics and trends teach you more than complex models applied prematurely.
Document your findings. Analysis is worthless if insights get lost. Share what you learn.
Resources for Learning Data Science
Kaggle - Datasets, competitions, tutorials, community DataCamp - Interactive data science courses Coursera Data Science Specialization - Johns Hopkins comprehensive program r/datascience - Community and resources
Data science isn't magic. It's asking good questions, finding relevant data, analyzing it honestly, and communicating findings clearly. Master those basics and you'll make better decisions than 90% of people building products.