Using NeuralTools to generate a pricing model for wool Kimbal Curtis and John Stanton Australian Wool Industry 70% of world trade in apparel wool is Australian wool Unlike other commodities • Each farm lot is fully measured • Each farm lot has an individual price About 450,000 farm lots sold each year in Australia Raw wool value of AUD3 billion annually Wool prices & market reporting Estimates of auction price on individual lots needed by sellers (farmers) Forecast auction price on individual lots required by buyers for contracts Market reporting of price paid for different wool types Neural nets & wool prices Neural nets attractive because • Number of records is large • Prices are dynamic • Price/attribute relationships are non-linear with interactions • Price/attribute relationships change over time • The data set is incomplete and imprecise All Merino fleece lots (Fremantle Jan-Mar 2006) Each grey dot represents a parcel of wool sold at auction i.e. a ‘case’ Long & short fleece lots (Fremantle Jan-Mar 2006) Long and short wool differentiated on price Merino pieces lots (Fremantle Jan-Mar 2006) Pieces wool (a subset of the wool clip) Changes to price diameter relationship (September) 2001 2003 2005 2007 The Challenge ! Market Indicators (Fremantle Jan-Mar 2006) Market indicators, like a stock market index, used to price wool Model development Stages 1. Assemble 6 month data set 2. Use Best Net Search 3. Evaluate predictive capability 4. Refine model Model development (1) Assemble 6 month data set Independent category and numeric variables Dependent numeric variable (price) Training, testing and prediction data Use Best Net Search Evaluate predictive capability Refine model Model development (2) Assemble a 6 month data set Use Best Net Search GRNN – proved best in most cases (generalised regression neural net) MLFN – also tried with up to 5 nodes (multi layer feed-forward neural net) Evaluate predictive capability Refine model Configuration summary Net Information Name Configurations Included in Search Best Configuration Location Independent Category Variables Independent Numeric Variables Dependent Variable Net Trained on Pieces wool sales, weeks 33 38, 2006 (3) GRNN, MLFN 2 to 3 nodes GRNN Numeric Predictor Palisade Conf Curtis v6 BNS 6hrs.xls 8 (Sale centre, Sale week, Sale outcome, Style, Med Hard Cotts, Unscourable Colour, Jowls, Dark Stain) 8 (Staple Length, Staple Strength, Vegetable Matter, Diameter, CV Diameter, Mid Breaks, Yield, Hauteur) Numeric Var. (Clean price) Model development (3) Assemble a 6 month data set Use Best Net Search Evaluate predictive capability Refine model Model evaluation (1) NeuralTools outputs Error measures Actual versus Predicted, Residuals Variable Impact Analysis Live Prediction Relationships between variables Compare to published market indicators Model evaluation (1) Training and Testing summary Training Number of Cases Training Time (h:min:sec) Number of Trials Reason Stopped % Bad Predictions (5% Tolerance) Root Mean Square Error Mean Absolute Error Std. Deviation of Abs. Error 5910 0:39:43 104 Auto-Stopped 14.7377% 24.72 16.42 18.48 Testing Number of Cases % Bad Predictions (5% Tolerance) Root Mean Square Error Mean Absolute Error Std. Deviation of Abs. Error 1507 43.3975% 53.18 36.99 38.21 Model evaluation - Training data (mean absolute error 16 cents) Model evaluation - Testing data (mean absolute error 37 cents) Model evaluation (1) Testing data (indicators) Observed versus predicted for the published Pieces Market indicators Most points are on the 1:1 line, but a small group hover above i.e. they have higher predicted values than reported Model evaluation (1) Variable impact analysis Relative Variable Impacts 0% Diameter Vegetable Matter Staple Length Jowls Hauteur Sale outcome Med Hard Cotts Yield CV Diameter Staple Strength Sale centre Sale week Dark Stain Style Unscourable Colour Mid Breaks 10% 20% 30% 40% 50% 60% 70% 41.3% 18.7% 11.7% 8.8% 7.7% 1.9% 1.8% 1.6% 1.2% 1.2% 1.1% This is a sensitivity analysis, 0.9% 0.7% not the percent of variance 0.6% accounted for by each variable 0.4% 0.4% Model evaluation (2) NeuralTools outputs Live Prediction Relationships between variables Compare to published market indicators • Error measures • Actual versus Predicted, Residuals • Variable Impact Analysis Model evaluation (2) Live prediction Simple spreadsheet pricing tool. Sale centre Sale week Style Change any of the values in the yellow cells, and ‘Live prediction’ updates the clean price Med Hard Cotts Unscourable Colour Jowls Dark Stain Fremantle W38 Average C0 H0 J0 S0 Diameter 20.0 Yield Vegetable Matter 50.0 2.5 Staple Length Staple Strength Mid Breaks Hauteur Clean price 80 35 55 62 664 Model evaluation (3) NeuralTools outputs • Error measures • Actual versus Predicted, Residuals • Variable Impact Analysis Live Prediction Relationships between variables Compare to published market indicators Model evaluation (3) relationships between variables 680 670 660 650 Clean 640 Price 630 620 610 600 590 Sydney Week 38 21 micron 2% VM 45 85 80 Staple Length 40 75 35 70 30 65 25 Staple Strength Model evaluation (3) relationships between variables 21 micron Fremantle Melbourne 635 680 680 630 670 670 625 660 660 650 620 650 Cl ean P r i ce Cl ean 615 P r i ce Cl ean 640 P r i ce 630 630 605 620 610 600 610 600 595 600 45 85 80 620 590 80 St a p l e 35 70 45 85 40 75 Lengt h St r e n g t h 30 80 St a p l e 35 70 Lengt h 65 45 85 40 75 St a p l e St r e n g t h 30 40 75 St a p l e 70 25 650 640 640 630 630 620 600 580 610 620 Cl ean Cl ean 560 P r i ce St r e n g t h 30 65 25 620 St a p l e 35 Lengt h 65 25 P r i ce 640 610 St a p l e 22 micron Sydney 610 Cl ean P r i ce 600 600 590 580 540 590 570 580 560 520 570 500 550 560 45 85 80 St a p l e Lengt h 540 80 40 75 35 70 30 65 25 45 85 St a p l e St r e n g t h St a p l e Lengt h 35 70 30 65 25 45 85 80 40 75 St a p l e St r e n g t h St a p l e Lengt h 40 75 35 70 30 65 25 St a p l e St r e n g t h Model evaluation (3) relationships between variables 19 micron Fremantle Melbourne 760 840 820 755 830 810 750 820 800 745 810 740 Cl ean P r i ce 790 735 Cl ean 730 P r i ce 725 800 Cl ean P r i ce 790 780 770 780 720 770 760 710 760 750 705 750 715 45 85 80 740 80 St a p l e 35 70 Lengt h 45 85 40 75 St a p l e St r e n g t h 30 80 St a p l e 35 70 Lengt h 65 45 85 40 75 St a p l e St r e n g t h 30 40 75 St a p l e 70 740 670 730 665 720 St r e n g t h 30 65 25 675 St a p l e 35 Lengt h 65 25 20 micron Sydney 25 720 710 700 Cl ean 660 Cl ean P r i ce 710 Cl ean P r i ce P r i ce 655 700 650 690 645 680 640 670 690 680 45 85 80 St a p l e Lengt h 670 660 80 40 75 35 70 30 65 25 45 85 St a p l e St r e n g t h St a p l e Lengt h 35 70 30 65 25 45 85 80 40 75 St a p l e St r e n g t h St a p l e Lengt h 40 75 35 70 30 65 25 St a p l e St r e n g t h Price spread variation Model evaluation (4) NeuralTools outputs • Error measures • Actual versus Predicted, Residuals • Variable Impact Analysis Live Prediction Relationships between variables Compare to published market indicators Model evaluation (4) predictive capability 20 micron indicator 22 micron indicator Melbourne Week 38 Model evaluation (4) predictive capability Melbourne Week 38 Dark blue lots have SL, SS and VM “similar” to market indicator definition Model evaluation (4) predictive capability Melbourne Week 37 Model evaluation (4) predictive capability Melbourne Week 37 Model evaluation (4) predictive capability Melbourne Week 36 Model evaluation (4) predictive capability Melbourne Week 35 Model evaluation (4) predictive capability Melbourne Week 34 Model evaluation (4) predictive capability Melbourne Week 33 Model evaluation (4) predictive capability Fremantle Week 37 Model evaluation (4) predictive capability Fremantle Week 38 Model development (4) Assemble a 6 month data set Use Best Net Search Evaluate predictive capability Refine model • Reduce variables • Combine selling centres • Sale week - category variable Some Neural Net applications Market reporting Price predictor Validation check for other estimates Missing sale problem Generate price matrices Estimate premiums and discounts Premium for “organic” wool 1400 Predicted price Predicted price 1400 1200 1000 800 800 1000 1200 Actual price April sale 1400 1200 1000 800 800 1000 1200 Actual price June-July sale 1400 Summary Data rich application with characteristics that looked ideal for NeuralTools Solutions generated which can support industry analysis and generation of indicators