The Central Limit Theorem (CLT) is a critical topic in statistics. Here are a handful of links and a video from Khan Academy to get you up to speed on the CLT.
I love examples that are so blatantly clear that even people who aren’t as crazy as I am about data can get it. Anscombe’s Quartet is exactly that, it shows that we must look at data visually to really ensure we understand the data.
Anscombe’s quartet comprises four datasets that have nearly identical simple statistical […]
I am slowly starting to get into Kaggle and want to eventually be one of the top competitors. I thought to myself I wonder what the top 10 competitors are like from a skill set? Well here are the top 10 as of December 21st, 2013:
You can find the up to date list […]
I am a huge fan of trying to take the complex, detailed, and unmanageable into a concise, usable and understandable list or outline. I love seeing subjects like data architecture management, data modeling, programming languages, and even Data Science put into visual models that make it easier for others to see conceptually how it exists. […]
You up for learning something new? or maybe like me want to “do” statistics? If you are looking to do data analysis then R is a great tool/language to learn.
What is R?
R is a system for statistical computation and graphics. It consists of a language plus a run-time environment with graphics, a […]
I heard an interesting comment today from a Data Scientist that really got me thinking. She said “We do Insight-to-Action Analytics and not Nice-to-Know analytics“. I admit its pretty catchy but I had to stop for a second and think about what that really means.
Nice to know analytics being the analysis and deep […]
In writing SQL queries one of the things you should really master is the use of wildcards. You will discover and probably hear it a million times that no data set is perfectly clean. There are always going to be issues where the spelling is wrong, or there are bad characters whatever […]
If you haven’t gotten a computer science degree you might not have spent much time on the topic of binary trees. Binary Trees are a key concept in the study of algorithms and data structures and are used to implement a binary search tree (BST) and binary heaps, as well as finding applications in efficient searching and sorting algorithms.
If you are looking to program in python it is essential you understand how to use packages to more than anything make your life easier. Packages are blocks of code that are written and made available for you to use in your code. Now one of the best and most widely used packages (really its […]
Sometimes its amazing how very simple concepts can actual be made so complicated and for little reason. For those of us who don’t have a Stats background but want or need to understand statistical concepts it doesn’t help that they are taught in a way that doesn’t make it easy to understand. Here I will […]
Please follow us :)5k
- Analytics (21)
- Big Data (9)
- Business Intelligence (59)
- Data Science (70)
- Miscellaneous (17)
Tags2008 Analysis Analytics Article Big Data Book Business Intelligence Charts Cognos Dashboards Data Data Visualization Data Warehouse Design Dimensional Fusion Tables Google Hadoop Humor IBM Logical Market Microsoft Model Modeling Operational Predictive Programming Python R Ralph Kimball Reporting Science Server SQL SQL Server SSIS Statistics TED Tools Tutorial Unstructured Video Visualization Warehousing