Exploring Big Data Through the Twitterverse – CF013

This week Ashton and Christian begin an exciting, multi-part series on how to leverage big data technologies to learn new skills in data analytics using one of the world’s cheapest and largest value data sources – tweets! We take a look at the Twitter Streaming API, techniques for collecting and storing these data sets, and basic visualization exercises in Java and HTML5. We also take a look at some automated Twitter interactions using Python scripting and basic rule sets. This is a great show to follow along with on our video feed if you typically are an audio-only listener. We’ll pick-up with how this project integrates with the data analytics sandbox we built and discussed in previous episodes. There is something for everyone in this conversation.

Cyber Frontiers is all about Exploring Cyber security, Big Data, and the Technologies Shaping the Future Through an Academic Perspective!   Christian Johnson, a student at the University of Maryland will bring fresh and relevant topics to the show based on the current work he does.

Support the Average Guy Tech Scholarship Fund: https://www.patreon.com/theaverageguy

WANT TO SUBSCRIBE? We now have Video Large / Small and Video iTunes options at http://theAverageGuy.tv/subscribe

You can contact us via email at jim@theaverageguy.tv or call in your questions or comments to be played on the show at (402) 478-8450

Listen Mobile:

 



This week Christian, Ashton, and their host Jim take a look at some of the opportunities to process Tweets using real time analysis.  Demonstrations include a letter count for tweets as well as a geolocation application, illustrating a small example of how Tweets can be used to create interesting pictures of the world around us.

Register an application here so that you can gain access to the API.

https://dev.twitter.com/

The Java library to interface with the Twitter API.

http://twitter4j.org/en/index.html

This article was particularly helpful in coming up with the storm topology.

http://data.linkedin.com/blog/2011/02/build-a-distributed-realtime-tweet-search-system-in-no-time-part-12

 


Jim’s Twitter: http://twitter.com/#!/jcollison

Contact Christian: christian@theaverageguy.tv

Contact the show at jim@theaverageguy.tv

Find this and other great Podcasts from the Average Guy Network at http://theaverageguy.tv

Music courtesy of Ryan King. Check out the Die Hard Cafe band and other original works at:
http://diehardcafe.bandcamp.com/http://cokehabitgo.tumblr.com/tagged/my-music