CS 179G: Project in Computer Science

Bigdata Analysis

Fall 2022

 

Instructor: Vagelis Hristidis (aka Evangelos Christidis)

Discussion time: M 5-5:50 pm

Location:  Watkins Hall | Room 2240

Office hour:  Mondays 4-5 pm at WCH 317

--------

TA: Shihab Rashid

 

Course Overview

This is a senior project course. We will talk about big data technologies in the class and about software development methodologies.

Students will form groups of 4 to work on the project.

Project Description

project

Grading

participation, attendance 5%

project 95%

Project Presentations

Each group will have about 8 minutes to present. All members must participate (speak). Will probably do the presentations in the Finals week.

 

Tentative Discussion Schedule

Date   Topic
9/26   Class intro and project intro (slides)
10/3   Intro to Crawling, Social Network APIs, Spark, NoSQL
10/10   More on Spark, NoSQL
10/17   Spark, NoSQL (cont'd)
10/24   Guest lecture 1: (Nani) Narayanan Srinivasan, mynd.co, remote
10/31   Agile Development: Scrum (slides)
11/7   Cassandra and LSM storage (slides)
11/14   Guest lecture 2: Moloud Shahbazi, PhD, Apple, remote
11/21 prepare final report (no lecture)
11/28   prepare presentation (no lecture)
11/30   project presentations at 6 pm in WCH 143 during Lab Section

Other material

computer science ethics and impact (slides),  Software Development Cycles, Design, Testing (slides)

 

Useful Links

http://spark.apache.org/

http://www.tutorialspoint.com/apache_spark/ 

http://cassandra.apache.org/

http://hector-client.github.io/hector/build/html/index.html, https://github.com/zznate/hector-examples

https://dev.twitter.com/docs/streaming-apis

http://nutch.apache.org/

 

 

Policies

Academic Integrity:  http://conduct.ucr.edu/learnPolicies/Pages/AcademicIntegrity.aspx

Standards of Conduct: http://conduct.ucr.edu/learnPolicies/Pages/StandardsofConduct.aspx