First Presentation Final Year Project 2013 Analyzing Stock

  • Slides: 18
Download presentation
First Presentation, Final Year Project, 2013 Analyzing Stock Quotes using Data Mining Techniques Name

First Presentation, Final Year Project, 2013 Analyzing Stock Quotes using Data Mining Techniques Name of Student: To Yi Fun University Number: 2010149103

Flow of Presentation • • Aim of the this classification for stock trade Theory

Flow of Presentation • • Aim of the this classification for stock trade Theory of Classification Decision Tree making Introduction of the application Structure and techs used in this application Preparation Interface

Flow of Presentation • • Demonstration Data Analysis What to do next Q&A

Flow of Presentation • • Demonstration Data Analysis What to do next Q&A

Aim • Find a model for class attribute as a function of others to

Aim • Find a model for class attribute as a function of others to group a class for previously unseen records • e. g. find out the classifier for historic stock price; Group companies into different classes for inspection • classier: decision tree, rule-based classifier

Theory for Decision Tree • A series of test conditions making to sort the

Theory for Decision Tree • A series of test conditions making to sort the instances into class • Greedy, split record based on attribute that best suit the criterion • Attribute (discrete) setting, 2 -way split; multiple -way split

Theory for Decision Tree • Best split -Gini Index, generalization of variance impurity -Entropy,

Theory for Decision Tree • Best split -Gini Index, generalization of variance impurity -Entropy, amount of impurity on a set • Aim: using a training set to provide a classifier for classifying testing set

Application Structure Download CSV 2 MYSQ LGENERAT OR Processed Data Filter Query (Splitting) Raw

Application Structure Download CSV 2 MYSQ LGENERAT OR Processed Data Filter Query (Splitting) Raw data Data processing Information presentation and arithmetic operation

Preparation • Downloading the stock historic data: for 30 DOM shares e. g. Pfizer,

Preparation • Downloading the stock historic data: for 30 DOM shares e. g. Pfizer, Bank of America, America Express, Exxon • Convert to. csv file to be processed by the CSV 2 MYSQLGENERATOR program, the result is a lengthy sql commands

Data Processing • Categories into different type of stock by its industries • Dow

Data Processing • Categories into different type of stock by its industries • Dow 30 as training set and 8 more stocks as testing set, mainly large scale company

Data Processing • Downloading the stock historic data: for 30 DOM shares e. g.

Data Processing • Downloading the stock historic data: for 30 DOM shares e. g. Pfizer, Bank of America, America Express, Exxon • Convert to. csv file to be processed by the CSV 2 MYSQLGENERATOR program, the result is a lengthy sql commands

Data Processing • Attributes Setting -HL_30 Days. Average: Tendency -HL_Change. Daily: Change -HL_Change. Perc:

Data Processing • Attributes Setting -HL_30 Days. Average: Tendency -HL_Change. Daily: Change -HL_Change. Perc: Difference -HL_Vol. Change: Popularity Class: -B_Rise. More 3 Perc 5 Day: Buy Signal

Data Processing • Attributes Setting

Data Processing • Attributes Setting

User Interface • Make Use of the mysql connector to input the processed data

User Interface • Make Use of the mysql connector to input the processed data into the C# • Three Major Components: -Input -Result Log -Test

Demonstration • Make Use of the mysql connector to input the processed data into

Demonstration • Make Use of the mysql connector to input the processed data into the C# • Three Major Components: -Input -Result Log -Test

Result

Result

Result Analysis Attributes Setting -HL_30 Days. Average: Tendency -HL_Change. Daily: Change -HL_Change. Perc: Difference

Result Analysis Attributes Setting -HL_30 Days. Average: Tendency -HL_Change. Daily: Change -HL_Change. Perc: Difference -HL_Vol. Change: Popularity

What to do Next • Implement a more user friendly UI for presenting the

What to do Next • Implement a more user friendly UI for presenting the stock price, visualize the tree and provide query service • Implement an splitting Algorithm using Gini and compare the difference of the results generated by these Algorithms

Q&A

Q&A