Addressing the Business Question (SQL Analysis)
Congrats on becoming a business analyst! Your database has been designed based on your requirements. Now it’s time to answer your business question:
Are wedding vendors with sustainable practices more cost effective?
Analysis Requirements (Jupyter Notebook)
Introduce the problem and define key terms
5-10 sentences
At least one credible source for each key term defined
Answer the business question
5-10 sentences
Make sure your results are statistically significant
Provide your top two actionable insights
5-10 sentences each
Provide at least one credible source per insight.
Make sure to go beyond the numbers. Note that the company is likely to already be taking advantage of common metrics such as correlations and is expecting a deeper level of analysis.
Use markdown to explain the rest of your analysis
250-450 words
Remember that markdown is used to explain what you, the analyst, has found important through the code. Code comments are used to explain the technical aspects of the code.
SQL Requirements
Provide the SQL queries needed to:
explore the data leading up to the creation of your final dataset
develop your final dataset (this is what will be exported into Excel and then read into Python)
Make sure to include a USE statement and ample comments throughout your code.
Do not use AI to generate any of your SQL code.
Python Requirements
Your code must generate the following:
Descriptive statistics
Frequency tables
Correlation
3-5 well-designed, highly relevant data visualizations (scatterplots, boxplots, etc.)
Make sure to avoid data dumping:
Remove any outputs/visuals that do not directly support your insights
Limit your tabular outputs
Do not use AI to generate any of your Python code.
Tips
To get your final dataset from SQL to Python, you may export the data from SQL into an Excel file and then imported into Python with pd.read_excel().
Avoid writing about what you did. Your stakeholders will assume that you took proper steps to analyze the data and do not have the bandwidth to read through your process. They are more interested in your answer to the business question, as well as your top two actionable insights.
Note that your stakeholders will start asking questions about the validity of your results if your insights stray from the SQL queries/Python code you provide.
Additional files (Excel, etc.) will not be assessed.
Deliverables
1. Submit a Jupyter Notebook in the following two formats:
Jupyter Notebook (.ipynb format)
HTML page, converted directly from the Jupyter Notebook interface (.html format)
2. Submit your SQL queries in the following two formats:
SQL script (.sql format)
Text file (.txt format)
Weighting
This assignment is worth 60% of your total grade for this course.