CS 179G: Project in Computer Science

Bigdata Analysis

Fall 2021

 

Instructor: Vagelis Hristidis (aka Evangelos Christidis)

Discussion time: M 5-5:50 pm

Location:  Zoom

Office hour:  Mondays 4-5 pm, Zoom

--------

TA: Merlin Mao

Office hour:  Tuesdays 2:30-3:30 pm

 

Course Overview

This is a senior project course. We will talk about big data technologies in the class and about software development methodologies.

Students will form groups of 4 to work on the project.

Project Description

project

Grading

participation, attendance 5%

project 95%

Project Presentations

Each group will have about 8 minutes to present. All members must participate (speak). Send a link to your video to the instructor and the TA, which could include a combination of slides with  you talking and demo or mockup of demo. After each presentation we will have a short live Q&A session.

 

Tentative Discussion Schedule

Date   Topic
9/27   Class intro and project intro (slides)
10/4   Intro to Crawling, Social Network APIs, Spark, NoSQL
10/11   More on Spark, NoSQL
10/18   Agile Development: Scrum (slides)
10/25   Guest lecture 1: Joe Barr, Acronis SCS
11/1   Guest lecture 2: (Nani) Narayanan Srinivasan, mynd.co
11/8   Guest lecture 3: Mohiuddin Qader, Ebay
11/15   project presentations 1 (groups 1,2,3,4)
11/22 project presentations 2 (groups 5,7,8,9)
11/29   project presentations backup (if needed)

Other material

computer science ethics and impact (slides),  Software Development Cycles, Design, Testing (slides)

 

Useful Links

http://spark.apache.org/

http://www.tutorialspoint.com/apache_spark/ 

http://cassandra.apache.org/

http://hector-client.github.io/hector/build/html/index.html, https://github.com/zznate/hector-examples

https://dev.twitter.com/docs/streaming-apis

http://nutch.apache.org/

 

 

Policies

Academic Integrity:  http://conduct.ucr.edu/learnPolicies/Pages/AcademicIntegrity.aspx

Standards of Conduct: http://conduct.ucr.edu/learnPolicies/Pages/StandardsofConduct.aspx