COMPETENCIES
4034.3.1 : Dimension, Quality, Relations, and Limitations
The graduate examines the data available for analysis to determine their dimension, quality, relations, and limitations.
4034.3.2 : Physical Data Models
The graduate implements physical data models.
4034.3.3 : Table Operations
The graduate performs table operations and queries within the context of data acquisition for analysis.
INTRODUCTION
A common undertaking for a data analyst is to connect an external data collection to an existing data set. This process involves the exploration of the source and target data sets to merge the new data in a meaningful and logical way.
You will replicate this common process in this task. You will take external data from a comma-separated values (CSV) file and design conceptual, logical, and physical models that describe the data. You will load the new data into an existing database alongside the existing data and run queries across it.
You will work on this performance assessment on the Labs on Demand website, which can be found in the Web Links section. The data sets, associated data dictionary, and CSV file are located at the Labs on Demand site.
REQUIREMENTS
Your submission must be your original work. No more than a combined total of 30%
of the submission and no more than a 10% match to any one individual source can be directly quoted or closely paraphrased from sources, even if cited correctly. The similarity report that is provided when you submit your task can be used as a guide.
You must use the rubric to direct the creation of your submission because it provides detailed criteria that will be used to evaluate your work. Each requirement below may be evaluated by more than one rubric aspect. The rubric aspect titles may contain hyperlinks to relevant portions of the course.
Tasks may not be submitted as cloud links, such as links to Google Docs, Google Slides, OneDrive, etc., unless specified in the task requirements. All other submissions must be file types that are uploaded and submitted as attachments (e.g., .docx, .pdf, .ppt).
Perform the following tasks to combine an existing data set with some external data:
A. Provide a question that can be answered using structured query language (SQL) to acquire data from both the original database and the add-on CSV file data. The question should require data from both data sources.
1. Identify which data from the original data set and the add-on CSV file are needed to answer the question including all tables, columns, and data types.