Big Data Mapreduce Course Save

Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University

Project README

Spring Quarter 2024

Big Data Modeling & Analytics

Santa Clara University



1. Course Information:

This course is about  big data and its role in carrying 
out modern business intelligence for actionable insight 
to address new business needs. This course is a lab-led 
and open source software rooted course.  Students  will 
learn the fundamentals  of MapReduce, Spark  framework,
NoSQL databases, PySpark, and Amazon Athena.  The class 
will  focus on  the storage,  processing, and  analysis 
aspects of big data.  Students  will use  Spark cluster 
and  MapReduce fundamentals to solve big data problems.

2. Class Meeting Dates & Hours

  • Class meeting dates:
    • Start: April 2, 2024
    • End: June 6, 2024
    • Final Exam week: June 10-13, 2024
  • Class hours:
Day Start time End time
Monday 5:45 PM PST 7:45 PM PST
Wednesday 5:45 PM PST 7:45 PM PST

3. Instructor, Adjunct Professor: Mahmoud Parsian

4. Prerequisite

5. Course Description & Concepts

6. Big Data Modeling Class Web Site

7. Glossary of Big Data, MapReduce, Spark

8. Required Books and Papers

9. Optional Books and References

10. Required Software: MapReduce & Spark/PySpark

11. Syllabus, Spring Quarter 2024

12. Grading and Class Conduct

13. Python Tutorials

14. SQL Tutorials

15. MapReduce Tutorials

16. PySpark Tutorials

17. Office Hours

18. Midterm Exam

19. Final Exam

20. Mahmoud Parsian's Latest Books:


Data Algorithms with Spark

Data Algorithms with Spark

PySpark Algorithms

PySpark Algorithms Book

Data Algorithms

Data Algorithms Book
Open Source Agenda is not affiliated with "Big Data Mapreduce Course" Project. README Source: mahmoudparsian/big-data-mapreduce-course