GitHub - jefferickson/two-americas: Capstone Project, Hunter College Department of Mathematics and Statistics

"Two Americas" Tweet Collector

Author: Jeff Erickson `<[email protected]>`

Date: 2018-05-21

Overview

A system to collect tweets from every county in the contiguous United States on a selection of politically-relevant search topics.

This system was used to collect data for my Masters capstone project at the Department of Mathematics and Statistics, Hunter College, The City University of New York.

Method of Search

The purpose of this system is to collect tweets systematically from around the United States. The search terms themselves are defined in advance: see datasets/topics.csv.

The Twitter API allows a user to specify a "search circle" to search within. A latitude, longitude, and radius must be provided. A "minimum bounding search circle" for each county has been defined in order to approximate each county: see datasets/county_listing.csv.

Data Storage

Data is stored in a MongoDB NoSQL database.

Setup and Usage

Get dependencies:

go get github.com/ChimeraCoder/anaconda
go get github.com/globalsign/mgo
go get gopkg.in/cheggaaa/pb.v1

Get the repo and install:

go get github.com/jefferickson/two-americas
go install github.com/jefferickson/two-americas

Get Twitter credentials here. Copy the file .env.example into .env and provide these credentials. Also include the connection details to MongoDB.
Source the .env and run the service:

source .env
$GOPATH/bin/two-americas

The service will run as long as you let it.

When ready, extract your data: A modifiable utility script is provided (see extract/toTSV.go) to extract the data into a "tidy" format. Once you have specified the fields you want to export, run:

go run $GOPATH/src/github.com/jefferickson/two-americas/extract/toTSV.go

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
control		control
datasets		datasets
extract		extract
model		model
twitter		twitter
util		util
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
twoamericas.go		twoamericas.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"Two Americas" Tweet Collector

Author: Jeff Erickson `<[email protected]>`

Date: 2018-05-21

Overview

Method of Search

Data Storage

Setup and Usage

About

Releases

Languages

jefferickson/two-americas

Folders and files

Latest commit

History

Repository files navigation

"Two Americas" Tweet Collector

Author: Jeff Erickson <[email protected]>

Date: 2018-05-21

Overview

Method of Search

Data Storage

Setup and Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages

Author: Jeff Erickson `<[email protected]>`