CS 179G: Project in Computer Science
Bigdata Analysis
Fall 2022
Instructor: Vagelis Hristidis (aka Evangelos Christidis)
Discussion time: M 5-5:50 pm
Location: Watkins Hall | Room 2240
Office hour: Mondays 4-5 pm at WCH 317
--------
TA: Shihab Rashid
Course Overview
This is a senior project course. We will talk about big data technologies in the class and about software development methodologies.
Students will form groups of 4 to work on the project.
Project Description
Grading
participation, attendance 5%
project 95%
Project Presentations
Each group will have about 8 minutes to present. All members must participate (speak). Will probably do the presentations in the Finals week.
Tentative Discussion Schedule
Date | Topic | |
9/26 | Class intro and project intro (slides) | |
10/3 | Intro to Crawling, Social Network APIs, Spark, NoSQL | |
10/10 | More on Spark, NoSQL | |
10/17 | Spark, NoSQL (cont'd) | |
10/24 | Guest lecture 1: (Nani) Narayanan Srinivasan, mynd.co, remote | |
10/31 | Agile Development: Scrum (slides) | |
11/7 | Cassandra and LSM storage (slides) | |
11/14 | Guest lecture 2: Moloud Shahbazi, PhD, Apple, remote | |
11/21 | prepare final report (no lecture) | |
11/28 | prepare presentation (no lecture) | |
11/30 | project presentations at 6 pm in WCH 143 during Lab Section |
Other material
computer science ethics and impact (slides), Software Development Cycles, Design, Testing (slides)
Useful Links
http://www.tutorialspoint.com/apache_spark/
http://hector-client.github.io/hector/build/html/index.html, https://github.com/zznate/hector-examples
https://dev.twitter.com/docs/streaming-apis
Policies
Academic Integrity: http://conduct.ucr.edu/learnPolicies/Pages/AcademicIntegrity.aspx
Standards of Conduct: http://conduct.ucr.edu/learnPolicies/Pages/StandardsofConduct.aspx