I could easily study and write solely on the topic of Big Data. I could dive deep into every single Apache project and all the other software offerings, white papers, and technologies around big data and I’d have a lot to write about. The challenge with this is that we are not robots, we can’t […]
You up for learning something new? or maybe like me want to “do” statistics? If you are looking to do data analysis then R is a great tool/language to learn.
What is R?
R is a system for statistical computation and graphics. It consists of a language plus a run-time environment with graphics, a […]
In writing SQL queries one of the things you should really master is the use of wildcards. You will discover and probably hear it a million times that no data set is perfectly clean. There are always going to be issues where the spelling is wrong, or there are bad characters whatever […]
If you haven’t gotten a computer science degree you might not have spent much time on the topic of binary trees. Binary Trees are a key concept in the study of algorithms and data structures and are used to implement a binary search tree (BST) and binary heaps, as well as finding applications in efficient searching and sorting algorithms.
If you are looking to program in python it is essential you understand how to use packages to more than anything make your life easier. Packages are blocks of code that are written and made available for you to use in your code. Now one of the best and most widely used packages (really its […]
Ok, let’s assume its the January 1st again and you have your new year’s resolution list in front of you. Since we are assuming, let’s also assume that one of the items on your list is to learn how to program in python! That’s funny, since you want to learn python I happen to have […]
In most data projects or tasks there is at a very simplified level really 3 main steps:
Import/Connect to the Data Work with/Analyze the Data Present/Extract the Data and/or Information
One of the key skills that will prove critical is to be able to connect to all sorts of data sources. Here I will show […]
Continuing on from Python Programming Tutorial 4: Modules and Functions, here is the video for Python Programming Tutorial 5: How to Save Your Programs
For many Perl is an incredibly useful “Glue” language. Meaning that Perl is a great language for binding things together like converting a database into a spreadsheet ready file or taking word documents and converting them to HTML documents for the web. In this short post we will walk through the steps needed to install […]
This is a great explanation that you can find on Google Code University:
Serial vs. Parallel Programming
In the early days of computing, programs were serial, that is, a program consisted of a sequence of instructions, where each instruction executed one after the other. It ran from start to finish on a single […]
Please follow us :)5k
- Analytics (21)
- Big Data (9)
- Business Intelligence (59)
- Data Science (70)
- Miscellaneous (17)
Tags2008 Analysis Analytics Article Big Data Book Business Intelligence Charts Cognos Dashboards Data Data Visualization Data Warehouse Design Dimensional Fusion Tables Google Hadoop Humor IBM Logical Market Microsoft Model Modeling Operational Predictive Programming Python R Ralph Kimball Reporting Science Server SQL SQL Server SSIS Statistics TED Tools Tutorial Unstructured Video Visualization Warehousing