CIS 4400 Data Warehousing for Analytics Spring 2020
- Slides: 15
CIS 4400 - Data Warehousing for Analytics Spring 2020 Week 1
This class has 2 professors! • B. Madhusudan [that’s me!] • Royce Kok [will be here starting next week] 2 Information Systems Program
About Me • Industry Professional with 20+ years of experience – – Been at big companies Been at startups Been on both coasts Multiple roles as Software Engineer, QA Lead, Business Analyst, Data Analyst, Project Manager, Program Manager, Business Manager, Dev/Team Lead, Scrummaster • You get the idea! – Fun Fact: I was on TV Once! 3 Information Systems Program
Now You! • • • Name Major Expected Graduation Date Job (if currently working) Fun Fact! 4 Information Systems Program
Syllabus • Let’s go over it! • http: //www. faculty. baruch. cuny. edu/bmadhusudan 5 Information Systems Program
Syllabus Grading breakdown: Midterm Exam 15% Final Exam 20% Programming Assignments 30% Data Warehouse Project 25% Class Participation (Quizzes) 10% 6 Information Systems Program
Key Points from Syllabus • Cheating will be dealt with harshly • Show up for class – If you cannot make it, let me know ahead of time; Just like work! – Otherwise you get a 0 on the quiz for that day • This is a DOING class; You will not pass by just reading. You need to be on the computer and do the assignments and projects – Start EARLY! • Remember: You have a 100% RIGHT NOW! 7 Information Systems Program
Coming up! • HW #0 -> Surveys; Due Thursday • HW #1 -> Coming next Tuesday 8 Information Systems Program
Pre-requisites • CIS 3400 or CIS 9340 (or equivalent): we will assume knowledge of in this class • This stuff just works and is used behind the scenes. • E. g. – – SQL E-R Diagrams, Normalization Database Triggers, Database Indexes Database Transactions / Concurrency : ACID, 2 PL • We will refer to as needed 9 Information Systems Program
Why should you learn DW? • The original Big Data! • SQL is still KING [If you are rusty, I strongly advise you to take a free SQL refresher class as advised in the e-mail I sent out!] • Great knowledge-base to have if you work in data: Data Science, Data Analyst, Business Analyst, Data Modeler, etc. • Industry is coming back to a lot of these practices • Lots of companies in this space! – Atscale. com – Snowflake. com – Memsql. com • Lots of initiatives: – Major elections in 2020 (Congress/Presidential/State/Local) – US Census 2020 is coming! – Business 10 Information Systems Program
Typical Analytical Questions (KPIs or Key Performance Indicators) • Which marketing media (print, web sites, email lists, etc. ) are underperforming and need to be dropped? • Which products are most often abandoned in the customer’s shopping cart? • What intersection has the most deadly accidents? • Which applicants are most likely to graduate if we give them a scholarship? • What factors indicate students at risk of dropping out? • Is there a relationship between ____ and ____? Fill in the blanks. 11 Information Systems Program
Why Data Warehousing? Source systems an organization is likely to have • • Human Resources General Ledger/Accounts Payable, Accounts Receivable Budgeting Help Desk Building Security Asset Management Contact Relationship Management Specialty: Manufacturing/Library/Clinical Care/Fundraising/Sales/etc. 12 Information Systems Program
Why Data Warehousing? What’s wrong with what we already have? Historically, systems were just not built that way. • It is difficult or impossible to answer analytical questions because the data are not available in a meaningful way. • If we get an answer, we often cannot get it again because the tools change and the data change. • There are multiple meanings for many data items with business departments disagreeing about them. • Because there is no central trusted mechanism, there are many answers generated that are not consistent. • A huge amount of employee time is spent putting together data into reports and much of the effort is duplicated across the organization. • There is no good way to store results to do trend analytics. Information Systems Program 13
Why Data Warehousing? The Goals of this Class • To understand the best practice structure of a data warehouse system. • To understand the best practice solution to these issues: to get data that are clean, consistent, current, conformed, comprehensive [5 Cs] available, and easy to use. • To understand the choices in end-user analytical tools. • To learn about the business processes involved, such as proposing a solution and gathering requirements. • To have hands-on experience building a data warehouse solution incorporating all of the above. 14 Information Systems Program
Announcement Program benefits include: • Employer Site Visits • Employer-Site Professional Development Workshops • Tech Workshops • Networking & Mentoring Opportunities • Group Tech Internship • Resume Building 15 Information Systems Program
- Cis 3400
- Upenn mse in data science
- Data mining in data warehouse
- Hive
- Mining fraud
- Data warehousing olap and data mining
- Data warehousing best practices
- Introduction to data warehouse
- "amplitude" analytics or "product analytics"
- Isrs 4400 revised
- 4400 standard
- Cisco 4400
- 80 sayısının yüzde 20 fazlası kaçtır
- Karanovo
- Four seasons korean movie
- Winter spring summer or fall