Skip to main content

The Makings of a Data Enthusiast Community


Let's Talk Data

If a tree falls in the forest, can anyone hear it…?
If data is perfectly analyzed but no one sees it, does it still impact our world…?


To be a thought leader in the data community is just as important as building technical skills.  So talking about data with others is an important part of what data professionals do.  That is why we see so many amazing platforms like Stack Overflow and blogs like Towards Data Science.  Sure, you will find a lot of data and projects featured on this site but what makes them dynamic is the conversations around the data and coding.  I hope this site can be one of those communities; a more intimate version.

Data in Its Purest Form

Most people think of data as little bits of information that we collect.  I tend to think of data as containers for questions just waiting to be answered.  The best in the business will always tell us that good data will make us "go look" or ask "why" or say "tell me more".

Beyond Raw Data

Brené Brown is one of my true inspirations.  She is a researcher who tackles tough social issues that others shy away from and has an unprecedented following.  In her book, The Power of Vulnerability, she explains that a story is data with a soul.  In the world of data analytics, we strive to take data and turn them into stories.  But how does this happen?  How do you take rows of data and turn it into a meaningful story that others actually care about when your presentation is done?  This is the goal that anyone who wrangles data faces.  My goal is to take those stories built from insights and solve problems creatively.  And hopefully, along the way, I will find myself nestled in a community of other data lovers on the same journey.

What will you do with the data you find?

Comments

Popular posts from this blog

The Many Faces of Data Visualization

The Many Faces of Data Visualization By Rich Hunzinger Data Visualization has become one of the common "buzz" phrases swirling around the internet these days. With all of the promises of Big Data and the IoT (Internet of Things), more organizations are making an effort to get more value from the voluminous data they generate. This frequently involves complex analysis - both real time and historical - combined with automation. A key factor in translating this data into actionable information, and thusly into informed action, is the means by which this data is visualized. Will it be seen in real time? And by whom? Will it be displayed in colorful bubble charts and trend graphs? Or will it be embedded in high-detail 3D graphics? What is the goal of the visualization? Is it to share information? Enable collaboration? Empower decision-making? Data visualization might be a popular concept, but we don't all have the same idea about what it means. For many organizations...

Using Data Effectively & Creatively to Protect Our Elderly Population

Exploring the Root Cause of Medicare Claims Fraud Interoperability has become the new buzz word within healthcare.  The goal is admirable: use computer systems to easily exchange data.  It sounds so simple.  We are even at a place with machine learning and artificial intelligence where all kinds of information can be easily shared.  So it makes me stop to think, why is Medicare fraud still so prevalent.  If we are sharing information and we can detect patterns, then why are we still having this issue.  It seems that the answer lies in the fact that while the capability exists, the willingness to share data across the right platforms still is not being done.  One carrier may have data within its organization while one regulatory body may internally share information and on the other hand providers respectively share information among one another.  Yet, if the American healthcare system is not a system at all, in that all the parts do not integr...

Data Wrangling — Some Tips during this covid season

In this covid season, thought of sharing some tips and sample codes on Data Wrangling. After getting a glance of Google Colab, my go to language for data wrangling has been Python. This blog hence uses Colab and Pandas on Python for data wrangling. Most of the data collected has format related issues because they are collected without the use of a software. These days, with tools like Google Forms coming into picture, it is much better, but when you have old data to deal with in its row form, there is no escape from formatting it and getting it to the shape that you need it to be in. This blog shows how to use colab, pandas and lambda functions to quickly format some data. I had written an earlier blog few days back on  Colab  and how to import data from excel to Colab. So, I shall skip that part now. Let us say, the below is the data that we need to format (the admission number column) and we need all the data to be in the format: YYYY-Num We can do this quickly using datafra...