"Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from noisy, structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains.", Wikipedia.
We will walk through examples focusing on both structured and unstructured data.
|Structured Data||Unstructured Data|
We will explore two data platforms: Microsoft Power BI and Google Colab
Our favorite places and our favorite foods!
Fill out the following Excel file
We will build from scratch a data-driven dashboard using this structured data. We will use Microsoft Power BI to design the dashboard.
Power Query M (Preprocess Data):
Power BI Visuals:
Unstructured Data: Freedom Narratives to KJV Fuzzy Matches
Open Source Tool on Colab: https://colab.research.google.com/drive/1HcMsxk7zhEU-AKtasR-435a7L3LlpiPL?usp=sharing
Mine Tweets Mentioning Baylor
Tool to Collect Tweets: Chrome browser extension NCapture
Tool to Mine Tweets: NVivo