Course materials and documentation for DS2002
Choose a name for your team.
Create a public GitHub repository for your project
a. Add README.md. For now just add a header with the title. For Milestone, see How to write a README.
b. Add a LICENSE file (for example LICENSE or LICENSE.md). To make software meaningfully open source, your GitHub repository should include a license that describes permissible use. I recommend MIT, but see GitHub’s guide to licensing a repository if you want other options.
Write a design plan:
Review the high-level schematic of your chosen project (see project subfolders). Create a PDF file with the following sections:
Team name:
Team members:
Project: A/B/C
a. Data Structures
Example SQL: | Field | Type | Example | | — | — | — | | Status | VARCHAR(25) | “SUCCESS” | | Created | DATETIME | 2026-04-10 14:31:19 | | … | … | … | … |
Think about appropriate normalization. If you’re setting up multiple tables, describe each separately.
Example NoSQL:
{
"Status": "OK",
"Created": "2026-04-10T14:31:19Z",
...
}
If you’re setting up multiple collections, describe each separately.
b. Describe which specific systems will be used at each step of your pipeline:
Add your design plan PDF to your GitHub repository.
Submit your plan
Submit the link to your project GitHub repo in Canvas.
Review approach with instructor
We will check in with each group in class to confirm your approach and answer questions about your design choices. The date will be shared in class and on Canvas.