MA 235 - Introduction to Data Science Description Introduction to the main tools of data science. Such as probability, statistical inference, regression, and machine learning. Introduction to analyzing data and turning data into decision making tools. Topics include collecting, wrangling, and sharing data, and how to communicate results through graphical visualizations.
The course will prepare students to learn the more advanced concepts and skills of data science.
Statistical software will be used. No previous knowledge of a Statistics software package or programing is necessary. (Some experience with programming is recommended.) Credit Hours: 4 Contact Hours: 4 Prerequisites/Other Requirements: None English Prerequisite(s): None Math Prerequisite(s): None Course Corequisite(s): MA 215 (C or Higher) – Must be taken either prior to or at the same time as this course. Academic Program Prerequisite: None Consent to Enroll in Course: No Department Consent Required Dual Enrollment Allowed?: Yes Course Fees: $19.00 Number of Times Course can be taken for credit: 1 Programs Where This Course is a Requirement: Data Science, Certificate General Education Requirement: None General Education Learner Outcomes (GELO): None Course Learning Outcomes: 1. Use technology to compute statistical measurements from data sets, then interpret the results.
2. Utilizing technology to create visual representations of statistical data.
3. Create and interpret hypothesis tests for population parameters.
4. Acquire and clean their own data, combining data from multiple sources, and data wrangle.
5. Demonstrate the use of complex models and begin to construct models of their own.
6. Formulate simple algorithms to solve problems, and can code them in a high-level language appropriate for data science work.
7. Create a written, structured presentation, or report consistent with Statistical literature. Course Outline: I. R Basics
I.A. Objects
I.B. Functions
1C. Scripts
II. Data
II.A. Tidy Data
II.B. Summarizing
II.C. Data Frames
III. Data Visualization
III.A. Plots
III.B. Robust Visualization
IV. Statistics
IV.A. Probability Distributions
IV.B. Random Variables
IV.C. Central Limit Theorem
V. Statistical Inference
V.A. Sampling methods
V.B. Confidence Intervals
V.C. p-Values
VI. Regression
VI.A. Simple Regression
VI.B. Multiple Regression
VI.C. Least Squares Estimates
VII. Data Wrangling
VII.A. Reshaping
VII.B Web Scraping
VII.C. Text Mining
VIII. Machine Learning
VIII.A. Smoothing
VIII.B. Predict Function
VIII.C. Regression
VIII.D. Nearest Neighbor
IX. Big Data Approved for Online and Hybrid Delivery?: Yes Instructional Strategies: Lecture: 0-90%
Applied work: 20-70%
Group work: 0-30%
Facilitated discussion: 0-20% Mandatory Course Components: Students will obtain a solid introduction to R as a functional programming language and will be able to use R to effectively compute statistical and graphical procedures. Equivalent Courses: None Name of Industry Recognize Credentials: None
Course-Specific Placement Test: None Course Aligned with ARW/IRW Pairing: NA Mandatory Department Assessment Measures: None Course Type: Program Requirement- Offering designed to meet the learning needs of students in a specific GRCC program. Course Format: Lecture - 1:1 Total Lecture Hours Per Week: 4 People Soft Course ID Number: 105131 Course CIP Code: 27.01 Maximum Course Enrollment: 25 General Room Request: Computer Lab, (or computer laptop cart.) High School Articulation Agreements exist?: No Non-Credit GRCC Articulation Agreement With What Area: No School: School of STEM Department: Mathematics Discipline: MA First Term Valid: Fall 2022 (8/1/2022) 1st Catalog Year: 2022-2023 Name of Course Author: Brian Hadley Faculty Credential Requirements: 18 graduate credit hours in discipline being taught (HLC Requirement), Master’s Degree (GRCC general requirement), Other (list below) Faculty Credential Requirement Details: Master’s Degree in Mathematics, or in a closely related field with at least 18 semester hours of graduate work in mathematics. A background in Statistics is required. Faculty must be comfortable with a Statistical programming environment. Course Review & Revision Year: 2026-2027
Add to Catalog (opens a new window)
|