Enroll in the Data Scholar Canvas course here!
Tyler Prochnow MEd
HHPR Doctoral Student
Public Health Research Assistant
tprochnow.com
Joshua Been
Digital Scholarship Librarian
Data Viz: Network Visualizations Using Gephi
This workshop will cover the fundamentals of creating networks using Gephi
1. Introduction to networks 2. Importing network data 3. Preparing spreadsheet data for Gephi 4. Modify nodes and edges to visualize networks |
5. Measure network attributes, such as degree, diameter, betweeness, and modularity 6. Symbolize visualization using labels and colors 7. Export to sharable image |
Tyler Prochnow MEd HHPR Doctoral Student Public Health Research Assistant tprochnow.com |
Joshua Been |
(1) Take Workshops, (2) Pass Quizzes, (3) Become a Data Scholar
Interested in becoming a Data Scholar?
Takes only six workshops! |
Pick any Two Categories Below, Take at Least Two Workshops from Each of Those Categories: (Total of 4)
|
|
Pick any One Category Below, Take at Least Two Workshops from That Category:
(Total of 2)
|
* Becoming a Data Scholar is not mandatory. Take any workshop you like.
If you receive an error related to Cannot find Java 1.8 or higher, head to https://java.com/en/download/manual.jsp. One common cause of this error on Windows computers is the 32-bit version installed instead of the 64-bit. Windows users, make sure to download and install the 64-bit version.
Contents
Launch Gephi and Open les-mis.gexf
|
File/Open and select les-mis.gexf |
Explore Overview Tab |
Overview
|
Explore Data Laboratory Tab
|
Data Laboratory |
Adjust Node Color to Represent Gender Attribute | |
Label Nodes by Attribute | |
Arrange Nodes
|
|
|
|
|
|
|
|
Head to Data Laboratory (new fields)
|
|
Adjust Node Size Proportionately by Betweeness Centrality | |
Rerun Noverlap and Label Adjust | |
Take a quick screenshot |
Document Similarity
Source Document: State of the Union Addresses (1790-2006) by United States. Presidents
Python Script via Google Colab |
TF-IDF: Term Frequency / Inverse Document Frequency Cosine Similarity: Similarity of the documents based on the TF-IDF values of all terms in the documents. |
By default, every node is connected to every other node as the similarity score between all pairs of Addresses were calculated. | |
Filter pairs of Addresses that have a similarity score of at least 0.3. | |
Minimize edge thickness | |
Filter by Degree
|
|
Size nodes by Betweeness Centrality
|
|
Layout:
|
|
Run Modularity Statistical tool to identify communities within our data.
|
|
Set Node Partition color to Modularity Class | |
Click Preview tab | |
Click Refresh | |
Labels
|
|
Click Refresh Click Export for image |
Copyright © Baylor® University. All rights reserved.
Report It | Title IX | Mental Health Resources | Anonymous Reporting | Legal Disclosures