CS 179G: Project in Computer Science
Bigdata Analysis
Fall 2021
Instructor: Vagelis Hristidis (aka Evangelos Christidis)
Discussion time: M 5-5:50 pm
Location: Zoom
Office hour: Mondays 4-5 pm, Zoom
--------
TA: Merlin Mao
Office hour: Tuesdays 2:30-3:30 pm
Course Overview
This is a senior project course. We will talk about big data technologies in the class and about software development methodologies.
Students will form groups of 4 to work on the project.
Project Description
Grading
participation, attendance 5%
project 95%
Project Presentations
Each group will have about 8 minutes to present. All members must participate (speak). Send a link to your video to the instructor and the TA, which could include a combination of slides with you talking and demo or mockup of demo. After each presentation we will have a short live Q&A session.
Tentative Discussion Schedule
Date | Topic | |
9/27 | Class intro and project intro (slides) | |
10/4 | Intro to Crawling, Social Network APIs, Spark, NoSQL | |
10/11 | More on Spark, NoSQL | |
10/18 | Agile Development: Scrum (slides) | |
10/25 | Guest lecture 1: Joe Barr, Acronis SCS | |
11/1 | Guest lecture 2: (Nani) Narayanan Srinivasan, mynd.co | |
11/8 | Guest lecture 3: Mohiuddin Qader, Ebay | |
11/15 | project presentations 1 (groups 1,2,3,4) | |
11/22 | project presentations 2 (groups 5,7,8,9) | |
11/29 | project presentations backup (if needed) |
Other material
computer science ethics and impact (slides), Software Development Cycles, Design, Testing (slides)
Useful Links
http://www.tutorialspoint.com/apache_spark/
http://hector-client.github.io/hector/build/html/index.html, https://github.com/zznate/hector-examples
https://dev.twitter.com/docs/streaming-apis
Policies
Academic Integrity: http://conduct.ucr.edu/learnPolicies/Pages/AcademicIntegrity.aspx
Standards of Conduct: http://conduct.ucr.edu/learnPolicies/Pages/StandardsofConduct.aspx