Assignments

Assignments in this course are individual tasks designed to enhance student learning and provide opportunities for independent practice. Detailed instructions for completing each assignment, along with submission guidelines, will be made available on the class website.

It is essential for students to adhere strictly to these submission instructions, as assignments that fail to comply or are submitted past the deadline will not be considered for grading.

This policy ensures fairness and consistency in the evaluation process while emphasizing the importance of personal responsibility in meeting academic expectations.


Due DateAssignmentStatus
2025-09-10Tutor Design IdeasPosted
2025-10-15Apache Spark ProjectPosted
2025-11-12Computer Vision ProjectPosted
2025-03-31AI Tutor ProjectNot Posted

Please review instructions for completing and submitting each of the assigments.

Apache Spark Project

Due: October 15, 2025 (Posted)
For your Spark programming project, you will explore a large, real-world public dataset using PySpark and transform complex data into actionable insights, culminating in professional visualizations. The goal is to simulate a practical data analytics workflow akin to those found in industry: you will acquire and preprocess the data at scale using Spark, define and compute three significant business or analytical insights, and present each insight through well-crafted static plots. This project challenges you not only to harness distributed computing for big data processing, but also to bridge the gap between large-scale computation and clear, impactful communication of findings, preparing your analyses as compelling visual reports ready for BI or decision-making audiences. Through this assignment, you’ll develop the core skills of data wrangling, scalable analysis, and effective storytelling with data—making your technical results accessible and persuasive to both technical and non-technical stakeholders.

Computer Vision Project

Due: November 12, 2025 (Posted)
This programming project challenges to develop a comprehensive computer vision system for analyzing urban traffic signs from real-world street scene imagery. Students will apply classical computer vision techniques, feature extraction methods, traditional machine learning classifiers, and modern deep learning approaches to build a robust traffic sign detection and recognition pipeline. The project uses the Mapillary Traffic Sign Dataset (https://www.mapillary.com/dataset/trafficsign) , which contains over 100,000 high-resolution images from diverse global locations with varying weather conditions, lighting, viewpoints, and camera sensors—making it an ideal testbed for advanced computer vision methods.