Data Science Trivia
Data Science trivia highlights the people, methods, and milestones that shaped a discipline at the crossroads of statistics, computing, and real-world problem solving. From foundational concepts to playful facts about algorithms, data, and discovery, it offers a family-friendly mix of easy, funny, and more challenging questions grounded in the fieldâs evolution.
Easy Data Science Trivia
13 questions
These easy Data Science trivia questions are great for beginners and kids around age 12 and under.
Question 1
Who mapped London cholera cases around the Broad Street pump in 1854?
Answer: John Snow
John Snow is known for mapping cholera cases in London around the Broad Street pump in 1854.
Question 2
Which pioneer used polar area diagrams to present mortality data from the Crimean War?
- A.Andrew Gelman
- B.Darrell Huff
- C.Florence Nightingale
- D.Hans Rosling
Answer: Florence Nightingale
Florence Nightingale used polar area diagrams to communicate mortality data from the Crimean War.
Question 3
Who wrote the 1925 book "Statistical Methods for Research Workers"?
Answer: Ronald Fisher
Ronald Fisher wrote the influential 1925 book "Statistical Methods for Research Workers.".
Question 4
Who published "Computing Machinery and Intelligence" in 1950?
Answer: Alan Turing
Alan Turing published "Computing Machinery and Intelligence" in 1950.
Question 5
Claude Shannon worked at which famous research organization?
Answer: Bell Labs
Claude Shannon worked at Bell Labs.
Question 6
Which researcher is strongly associated with Statistical Learning Theory?
Answer: Vladimir Vapnik
Vladimir Vapnik is strongly associated with Statistical Learning Theory.
Question 7
Who co-founded Coursera in 2012?
Answer: Andrew Ng
Andrew Ng co-founded the online education platform Coursera in 2012.
Question 8
Which 2018 Turing Award co-winner is Geoffrey Hinton?
Answer: Geoffrey Hinton
Geoffrey Hinton shared the 2018 Turing Award.
Question 9
Who shared the 2018 Turing Award alongside other deep learning leaders?
Answer: Yoshua Bengio
Yoshua Bengio shared the 2018 Turing Award.
Question 10
Who introduced generative adversarial networks in 2014?
Answer: Ian Goodfellow
Ian Goodfellow introduced generative adversarial networks in 2014.
Question 11
Which R package creator is Hadley Wickham known for?
Answer: ggplot2
Hadley Wickham created the ggplot2 package for R.
Question 12
Who co-founded the Gapminder Foundation?
Answer: Hans Rosling
Hans Rosling co-founded the Gapminder Foundation.
Question 13
Which statistics website was founded by Nate Silver?
Answer: FiveThirtyEight
Nate Silver founded the statistics website FiveThirtyEight.
Data Science Family Trivia
12 questions
These family Data Science trivia questions are built for mixed-age game nights, classrooms, and groups.
Question 1
In what year was the first edition of Francis Anscombe's famous quartet published?
Answer: 1973
Francis Anscombe's famous quartet was first published in 1973.
Question 2
How many data points are in each dataset of Anscombe's quartet?
Answer: 11
All four datasets in Anscombe's quartet contain 11 data points each.
Question 3
Who wrote the book "The Visual Display of Quantitative Information"?
Answer: Edward Tufte
Edward Tufte is the author of "The Visual Display of Quantitative Information.".
Question 4
Which member of Hans Rosling's family helped co-author "Factfulness" with him?
Answer: Ola Rosling
Hans Rosling's son Ola Rosling is one of the co-authors of "Factfulness.".
Question 5
Besides Hans Rosling and Ola Rosling, which co-author helped write "Factfulness"?
Answer: Anna Rosling Rönnlund
Anna Rosling Rönnlund is a co-author of "Factfulness.".
Question 6
Who popularized the term "debugging" after a moth was found in a computer relay?
Answer: Grace Hopper
Grace Hopper is famous for popularizing the term "debugging" after a moth was found in a relay.
Question 7
True or false: The 2016 film "Hidden Figures" follows mathematicians who worked for NASA.?
Answer: True
"Hidden Figures" is about mathematicians who worked for NASA.
Question 8
If you visited the Jet Propulsion Laboratory, which U.S. state would you be in?
Answer: California
Jet Propulsion Laboratory is in California.
Question 9
Los Alamos National Laboratory is located in which state?
Answer: New Mexico
Los Alamos National Laboratory is in New Mexico.
Question 10
Which state is home to Oak Ridge National Laboratory?
Answer: Tennessee
Oak Ridge National Laboratory is in Tennessee.
Question 11
Bletchley Park is in which country?
Answer: England
Bletchley Park is located in England.
Question 12
The University of Cambridge is located in which city and country?
Answer: Cambridge, England
The University of Cambridge is in Cambridge, England.
Fun Data Science Trivia
13 questions
These fun Data Science trivia questions highlight surprising moments and playful facts for game-night groups.
Question 1
Which statistics classic starts with the wonderfully suspicious word "How"?
Answer: How to Lie with Statistics
Darrell Huff's best-known statistics book is titled "How to Lie with Statistics," and its title indeed begins with "How.".
Question 2
Which book snagged the 2012 National Academies Communication Award: "Thinking, Fast and Slow" or "The Signal and the Noise"?
- A.Thinking, Fast and Slow
- B.The Signal and the Noise
- C.Moneyball
- D.How to Lie with Statistics
Answer: Thinking, Fast and Slow
"Thinking, Fast and Slow" won the 2012 National Academies Communication Award.
Question 3
Who created the TeX typesetting system, giving technical writing a very precise personality?
Answer: Donald Knuth
Donald Knuth created the TeX typesetting system.
Question 4
During World War II, who made major contributions to sequential analysis?
Answer: Abraham Wald
Abraham Wald made major contributions to sequential analysis during World War II.
Question 5
What term did Arthur Samuel coin in 1959?
Answer: Machine Learning
The term "machine learning" was coined by Arthur Samuel in 1959.
Question 6
Which came first: the perceptron or the coining of the term "machine learning"?
Answer: The perceptron
The perceptron was introduced in 1958, while "machine learning" was coined in 1959, so the perceptron came first.
Question 7
Frank Rosenblatt introduced which early learning model in 1958?
Answer: perceptron
The perceptron was introduced by Frank Rosenblatt in 1958.
Question 8
Which method was described in 1951 by Evelyn Fix and Joseph Hodges?
Answer: k-nearest neighbors
K-nearest neighbors was described by Evelyn Fix and Joseph Hodges in 1951.
Question 9
A 1977 paper by Dempster, Laird, and Rubin formalized what algorithm?
Answer: expectation-maximization algorithm
The expectation-maximization algorithm was formalized in a 1977 paper by Dempster, Laird, and Rubin.
Question 10
Which method did Leo Breiman introduce in 2001?
Answer: random forest
Leo Breiman introduced the random forest method in 2001.
Question 11
Which algorithm was developed by Larry Page and Sergey Brin?
Answer: PageRank
PageRank was developed by Larry Page and Sergey Brin.
Question 12
What competition put a cool $1 million on the table for improving movie recommendation accuracy?
Answer: Netflix Prize
The Netflix Prize offered $1 million for improving movie recommendation accuracy.
Question 13
Who is known for public outreach on both data science and astronomy?
Answer: Kirk Borne
Kirk Borne is known for public outreach on data science and astronomy.
Funny Data Science Trivia
13 questions
These funny Data Science trivia questions highlight playful moments, odd facts, and inside jokes.
Question 1
What classic data-analysis lesson says your chart should get a turn before your hot take does?
Answer: Always plot your data.
A famous lesson in data analysis is the reminder to always visualize the data rather than relying only on summaries.
Question 2
Which computing-era warning from the 1950s politely says bad input leads to bad output, even if the computer looks very confident?
Answer: garbage in, garbage out
The phrase 'garbage in, garbage out' was popularized in computing in the 1950s.
Question 3
Which language was created by Ross Ihaka and Robert Gentleman in the 1990s, giving statisticians yet another reason to argue about plotting defaults?
Answer: R
R was created by Ross Ihaka and Robert Gentleman in the 1990s.
Question 4
Guido van Rossum first released which language in 1991, eventually enabling many notebooks, scripts, and suspiciously optimistic comments?
Answer: Python
Python was first released by Guido van Rossum in 1991.
Question 5
Which Python library created by Wes McKinney became famous for making tables slightly less scary?
Answer: pandas
The pandas library for Python was created by Wes McKinney.
Question 6
What plotting package took its name from the phrase 'Grammar of Graphics,' as if charts needed their own textbook title?
Answer: ggplot2
Ggplot2 takes its name from the 'Grammar of Graphics.'.
Question 7
Which Python visualization library was created by Michael Waskom, helping many people make prettier plots than their first draft deserved?
Answer: seaborn
The seaborn visualization library for Python was created by Michael Waskom.
Question 8
Which package name is literally short for 'Numerical Python,' in case the full title felt too committed?
Answer: NumPy
NumPy is short for Numerical Python.
Question 9
Which package name expands to 'Scientific Python,' sounding exactly as serious as it intends to?
Answer: SciPy
SciPy is short for Scientific Python.
Question 10
What term describes searching through data until something finally looks significant enough to wave around dramatically?
Answer: data dredging
Data dredging refers to searching for patterns until something looks significant.
Question 11
Finish the standard statistics warning: correlation does not imply _____.?
Answer: causation
The standard warning is 'correlation does not imply causation.'.
Question 12
Nassim Nicholas Taleb popularized which lesson about misleading trends, starring a bird whose confidence is not matched by its timeline?
Answer: the turkey illusion
The 'turkey illusion' was popularized by Nassim Nicholas Taleb is a lesson about misleading trends.
Question 13
Who is credited with the line, 'All models are wrong, but some are useful'âa quote that manages to be rude and helpful at the same time?
Answer: George Box
The quote is attributed to statistician George Box.
Hard Data Science Trivia
14 questions
These hard Data Science trivia questions are for expert fans who want a real challenge.
Question 1
Which method became widely known after the 1995 work of Cortes and Vapnik?
Answer: Support vector machine
The support vector machine became widely known after the 1995 work of Cortes and Vapnik.
Question 2
What was the publication year of the backpropagation paper that reignited interest in neural networks?
Answer: 1986
The cited backpropagation paper that renewed neural network interest was published in 1986.
Question 3
In what year did the ImageNet Large Scale Visual Recognition Challenge begin?
Answer: 2010
The challenge began in 2010 .
Question 4
Which model won the ImageNet competition in 2012?
Answer: AlexNet
AlexNet won the ImageNet competition in 2012.
Question 5
What system defeated Lee Sedol in Seoul in 2016?
Answer: AlphaGo
DeepMind's AlphaGo defeated Lee Sedol in Seoul in 2016.
Question 6
The clustering algorithm introduced in 1996 was which one?
Answer: DBSCAN
DBSCAN was introduced in 1996.
Question 7
Robert Tibshirani introduced which method in 1996?
Answer: LASSO
Robert Tibshirani introduced LASSO in 1996.
Question 8
Which resampling method did Bradley Efron introduce in 1979?
Answer: Bootstrap
Bradley Efron introduced the bootstrap in 1979.
Question 9
What method was proposed by Maurice Quenouille before the bootstrap arrived?
Answer: Jackknife
The jackknife was proposed by Maurice Quenouille before the bootstrap.
Question 10
Name the algorithm that was published in 1953 and later became foundational in Monte Carlo methods.?
Answer: Metropolis algorithm
The Metropolis algorithm was published in 1953.
Question 11
What optimization theorem was published by Wolpert and Macready in 1997?
Answer: No Free Lunch theorem
Wolpert and Macready published the No Free Lunch theorem for optimization in 1997.
Question 12
The ROC curve was originally developed in connection with what wartime application area?
Answer: Radar signal detection during World War II
The ROC curve was developed during radar signal detection in World War II.
Question 13
In model evaluation shorthand, what does AUC stand for?
Answer: Area under the curve
AUC stands for area under the curve.
Question 14
Chronology check: which came first, the Metropolis algorithm or the Gibbs sampler entering mainstream statistics?
Answer: The Metropolis algorithm came first.
The Metropolis algorithm was published in 1953, while the Gibbs sampler entered mainstream statistics through a 1990 paper.
Download PDF
Get the generated PDF file for printing, classroom rounds, or offline use.
Download printable trivia PDF