CIS 4400 Data Warehousing for Analytics Spring 2020

  • Slides: 15
Download presentation
CIS 4400 - Data Warehousing for Analytics Spring 2020 Week 1

CIS 4400 - Data Warehousing for Analytics Spring 2020 Week 1

This class has 2 professors! • B. Madhusudan [that’s me!] • Royce Kok [will

This class has 2 professors! • B. Madhusudan [that’s me!] • Royce Kok [will be here starting next week] 2 Information Systems Program

About Me • Industry Professional with 20+ years of experience – – Been at

About Me • Industry Professional with 20+ years of experience – – Been at big companies Been at startups Been on both coasts Multiple roles as Software Engineer, QA Lead, Business Analyst, Data Analyst, Project Manager, Program Manager, Business Manager, Dev/Team Lead, Scrummaster • You get the idea! – Fun Fact: I was on TV Once! 3 Information Systems Program

Now You! • • • Name Major Expected Graduation Date Job (if currently working)

Now You! • • • Name Major Expected Graduation Date Job (if currently working) Fun Fact! 4 Information Systems Program

Syllabus • Let’s go over it! • http: //www. faculty. baruch. cuny. edu/bmadhusudan 5

Syllabus • Let’s go over it! • http: //www. faculty. baruch. cuny. edu/bmadhusudan 5 Information Systems Program

Syllabus Grading breakdown: Midterm Exam 15% Final Exam 20% Programming Assignments 30% Data Warehouse

Syllabus Grading breakdown: Midterm Exam 15% Final Exam 20% Programming Assignments 30% Data Warehouse Project 25% Class Participation (Quizzes) 10% 6 Information Systems Program

Key Points from Syllabus • Cheating will be dealt with harshly • Show up

Key Points from Syllabus • Cheating will be dealt with harshly • Show up for class – If you cannot make it, let me know ahead of time; Just like work! – Otherwise you get a 0 on the quiz for that day • This is a DOING class; You will not pass by just reading. You need to be on the computer and do the assignments and projects – Start EARLY! • Remember: You have a 100% RIGHT NOW! 7 Information Systems Program

Coming up! • HW #0 -> Surveys; Due Thursday • HW #1 -> Coming

Coming up! • HW #0 -> Surveys; Due Thursday • HW #1 -> Coming next Tuesday 8 Information Systems Program

Pre-requisites • CIS 3400 or CIS 9340 (or equivalent): we will assume knowledge of

Pre-requisites • CIS 3400 or CIS 9340 (or equivalent): we will assume knowledge of in this class • This stuff just works and is used behind the scenes. • E. g. – – SQL E-R Diagrams, Normalization Database Triggers, Database Indexes Database Transactions / Concurrency : ACID, 2 PL • We will refer to as needed 9 Information Systems Program

Why should you learn DW? • The original Big Data! • SQL is still

Why should you learn DW? • The original Big Data! • SQL is still KING [If you are rusty, I strongly advise you to take a free SQL refresher class as advised in the e-mail I sent out!] • Great knowledge-base to have if you work in data: Data Science, Data Analyst, Business Analyst, Data Modeler, etc. • Industry is coming back to a lot of these practices • Lots of companies in this space! – Atscale. com – Snowflake. com – Memsql. com • Lots of initiatives: – Major elections in 2020 (Congress/Presidential/State/Local) – US Census 2020 is coming! – Business 10 Information Systems Program

Typical Analytical Questions (KPIs or Key Performance Indicators) • Which marketing media (print, web

Typical Analytical Questions (KPIs or Key Performance Indicators) • Which marketing media (print, web sites, email lists, etc. ) are underperforming and need to be dropped? • Which products are most often abandoned in the customer’s shopping cart? • What intersection has the most deadly accidents? • Which applicants are most likely to graduate if we give them a scholarship? • What factors indicate students at risk of dropping out? • Is there a relationship between ____ and ____? Fill in the blanks. 11 Information Systems Program

Why Data Warehousing? Source systems an organization is likely to have • • Human

Why Data Warehousing? Source systems an organization is likely to have • • Human Resources General Ledger/Accounts Payable, Accounts Receivable Budgeting Help Desk Building Security Asset Management Contact Relationship Management Specialty: Manufacturing/Library/Clinical Care/Fundraising/Sales/etc. 12 Information Systems Program

Why Data Warehousing? What’s wrong with what we already have? Historically, systems were just

Why Data Warehousing? What’s wrong with what we already have? Historically, systems were just not built that way. • It is difficult or impossible to answer analytical questions because the data are not available in a meaningful way. • If we get an answer, we often cannot get it again because the tools change and the data change. • There are multiple meanings for many data items with business departments disagreeing about them. • Because there is no central trusted mechanism, there are many answers generated that are not consistent. • A huge amount of employee time is spent putting together data into reports and much of the effort is duplicated across the organization. • There is no good way to store results to do trend analytics. Information Systems Program 13

Why Data Warehousing? The Goals of this Class • To understand the best practice

Why Data Warehousing? The Goals of this Class • To understand the best practice structure of a data warehouse system. • To understand the best practice solution to these issues: to get data that are clean, consistent, current, conformed, comprehensive [5 Cs] available, and easy to use. • To understand the choices in end-user analytical tools. • To learn about the business processes involved, such as proposing a solution and gathering requirements. • To have hands-on experience building a data warehouse solution incorporating all of the above. 14 Information Systems Program

Announcement Program benefits include: • Employer Site Visits • Employer-Site Professional Development Workshops •

Announcement Program benefits include: • Employer Site Visits • Employer-Site Professional Development Workshops • Tech Workshops • Networking & Mentoring Opportunities • Group Tech Internship • Resume Building 15 Information Systems Program