Enroll in the Data Scholar Canvas course here!
The quiz for this workshop is listed under Finding Secondary Data
Click Here for Workshop Schedule
(1) Take Workshops, (2) Pass Quizzes, (3) Become a Data Scholar
Interested in becoming a Data Scholar?
Takes only six workshops! |
Pick any Two Categories Below, Take at Least Two Workshops from Each of Those Categories: (Total of 4)
|
|
Pick any One Category Below, Take at Least Two Workshops from That Category:
(Total of 2)
|
* Becoming a Data Scholar is not mandatory. Take any workshop you like.
Workshop participants will use NCapture to extract recent Twitter content. We will then import the content into NVivo, where we will explore maps, wordclouds, and themes.
Recent Tweets (5-7 days, up to 18,000 Tweets) Advised Method |
Free Chrome extension that allows users to mine recent Twitter data. Tweets that include a particular word, phrase or hashtag, or Tweets by a particular user. Data is stored in Chrome's default download directory. NVivo is required to access and analyze the data. The advantage to NCapture is the ability to analyze the data using NVivo. |
Recent Tweets (5-7 days, up to 18,000 Tweets) Alternative Method |
Basic (free) version allows up to 2,000 Tweets. |
Historic Twitter Data |
Archive Team: The Twitter Stream Grab “A simple collection of JSON grabbed from the general twitter stream, for the purposes of research, history, testing and memory. This is the “Spritzer” version, the most light and shallow of Twitter grabs. Unfortunately, we do not currently have access to the Sprinkler or Garden Hose versions of the stream.” Monthly archives are compressed tarballs (.tar), containing hourly Tweet archives compressed as bzip2 files (.bz2). Uncompressed archives are in the standard Twitter JSON format, and contain all fields. |
Baylor Libraries Python Script to Mine Archive Team: The Twitter Stream Grab content |
Requirements: Anaconda Python Video Walk-Through: Stream Video (no audio) The Archive Team: The Twitter Stream Grab provides historic downloads of Twitter archives by month. This script helps researchers to mine this content for a list of Output is a .csv file containing one record per relationship. Relationships are classified as either (1) reply, (2) mention, or (3) tweet. A reply is a direct response to another user's post. A mention is where another user is mentioned, but not a diret reply. A tweet relationship are tweets with neither no replies or mentions. See the modify section below to specify (1) keywords/hashtags, (2) top-level directory, and (3) output file name. |
Since 2016, Facebook has locked down much of their content, making it difficult to mine for research purposes.
What is accessible?
|
Workshop participants will mine Reddit text and images by subreddit.
https://colab.research.google.com/drive/1zPGTNCXR3NCR798t5si-15uu7vXI3LrK
Cheat Code: https://researchguides.baylor.edu/c.php?g=980986
Participants will mine Instagram images by hashtag
Mine Instagram by Hashtag - Tool created by Baylor University Libraries
https://colab.research.google.com/drive/13GLjXP8TGD11wE5w5EwIQlLjBEvj226u
Copyright © Baylor® University. All rights reserved.
Report It | Title IX | Mental Health Resources | Anonymous Reporting | Legal Disclosures