CS 179G: Project in Computer Science

Big data Analysis

Fall 2023

 

Instructor: Vagelis Hristidis (aka Evangelos Christidis)

Discussion time: W 5-5:50 pm

Location:  Materials Sci and Engineering Room 103

Office hour:  Wednesdays 4-5 pm at WCH 363

--------

TA: Shihab Rashid

 

Course Overview

This is a senior project course. We will talk about big data technologies in the class and about software development methodologies.

Students will form groups of 4 to work on the project.

Project Description

project

Grading

participation, attendance 5%

project 95%

Project Presentations

Each group will have about 8 minutes to present. All members must participate (speak). Will probably do the presentations in the Finals week.

 

Tentative Discussion Schedule

Date Topic
10/4 Class intro and project intro
10/11 Big data sources, Spark
10/18 Spark (cont'd)
10/25 NoSQL, Cassandra
11/1 Guest lecture 1: Mohiuddin Qader, eBay
11/8 Agile Development: Scrum, privacy and security considerations
11/15 Guest lecture 2: Merlin Mao, Bytedance
11/22 no lecture: prepare report and presentation
11/29 presentations
12/6 presentations

Other material

computer science ethics and impact (slides),  Software Development Cycles, Design, Testing (slides)

 

Useful Links

http://spark.apache.org/

http://www.tutorialspoint.com/apache_spark/ 

http://cassandra.apache.org/

http://hector-client.github.io/hector/build/html/index.html, https://github.com/zznate/hector-examples

https://dev.twitter.com/docs/streaming-apis

http://nutch.apache.org/

 

 

Policies

Academic Integrity:  http://conduct.ucr.edu/learnPolicies/Pages/AcademicIntegrity.aspx

Standards of Conduct: http://conduct.ucr.edu/learnPolicies/Pages/StandardsofConduct.aspx