Data Analysis Journal

Data Analysis Journal

SQL

How To Get Randomly Distributed Users in SQL - Issue 123

Some simple and quick ways to apply sampling techniques for your analysis to get random users.

Olga Berezovsky's avatar
Olga Berezovsky
Dec 14, 2022
∙ Paid

I find it fascinating that today you can complete the full analysis lifecycle using only SQL without supplementing it with Python or Excel (without visualizations, of course). This is in lieu of when I also wrote about all the cool things you can do in SQL, from EDA to formatting, to forecasting, to advanced stats. 

Today’s topic is about how to apply different sampling techniques in SQL for your analysis - select a representative subset of data to identify patterns and trends in the larger data set. While SQL is not the best tool for getting randomly, uniformly, or normally distributed users (that’s where you would have to supplement with Python), you can make it work in SQL as well, and below, I’ll show you how.

When do you need to get randomly distributed users?

Keep reading with a 7-day free trial

Subscribe to Data Analysis Journal to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Olga Berezovsky · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture