Enroll in the Data Scholar Canvas course here!
Tyler Prochnow MEd
HHPR Doctoral Student
Public Health Research Assistant
tprochnow.com
Joshua Been
Digital Scholarship Librarian

Data Viz: Network Visualizations Using Gephi
This workshop will cover the fundamentals of creating networks using Gephi
|
1. Introduction to networks 2. Importing network data 3. Preparing spreadsheet data for Gephi 4. Modify nodes and edges to visualize networks |
5. Measure network attributes, such as degree, diameter, betweeness, and modularity 6. Symbolize visualization using labels and colors 7. Export to sharable image |
| Tyler Prochnow MEd HHPR Doctoral Student Public Health Research Assistant tprochnow.com |
Joshua Been |
(1) Take Workshops, (2) Pass Quizzes, (3) Become a Data Scholar
| Interested in becoming a Data Scholar?
Takes only six workshops! |
Pick any Two Categories Below, Take at Least Two Workshops from Each of Those Categories: (Total of 4)
|
|
Pick any One Category Below, Take at Least Two Workshops from That Category:
(Total of 2)
|
* Becoming a Data Scholar is not mandatory. Take any workshop you like.

If you receive an error related to Cannot find Java 1.8 or higher, head to https://java.com/en/download/manual.jsp. One common cause of this error on Windows computers is the 32-bit version installed instead of the 64-bit. Windows users, make sure to download and install the 64-bit version.
Contents
|
Launch Gephi and Open les-mis.gexf
|
File/Open and select les-mis.gexf
|
| Explore Overview Tab |
Overview
|
|
Explore Data Laboratory Tab
|
Data Laboratory
|
| Adjust Node Color to Represent Gender Attribute | ![]() |
| Label Nodes by Attribute | ![]() |
|
Arrange Nodes
|
![]() ![]() |
|
|
|
![]() |
|
![]() ![]() |
|
Head to Data Laboratory (new fields)
|
![]() |
| Adjust Node Size Proportionately by Betweeness Centrality | ![]() |
| Rerun Noverlap and Label Adjust | ![]() |
| Take a quick screenshot | ![]() |
|
Document Similarity
Source Document: State of the Union Addresses (1790-2006) by United States. Presidents
Python Script via Google Colab |
TF-IDF: Term Frequency / Inverse Document Frequency Cosine Similarity: Similarity of the documents based on the TF-IDF values of all terms in the documents. |
| By default, every node is connected to every other node as the similarity score between all pairs of Addresses were calculated. | ![]() |
| Filter pairs of Addresses that have a similarity score of at least 0.3. | ![]() ![]() |
| Minimize edge thickness | ![]() |
|
Filter by Degree
|
![]() ![]() |
|
Size nodes by Betweeness Centrality
|
![]() ![]() |
|
Layout:
|
|
|
Run Modularity Statistical tool to identify communities within our data.
|
![]() ![]() |
| Set Node Partition color to Modularity Class | ![]() |
| Click Preview tab | ![]() |
| Click Refresh | |
|
Labels
|
![]() |
|
Click Refresh Click Export for image |
Copyright © Baylor® University. All rights reserved.
Report It | Title IX | Mental Health Resources | Anonymous Reporting | Legal Disclosures