Build Business Smart.

How to Solve Problems with Simple Predictive Analytics

Predictive Analytics
Predictive analytics applies to a variety of business problems faced today, and more people are beginning to recognize its value. Businesses and nonprofits are using predictive analytics to answer real business questions like “What segment of potential donors will respond best to our message” and “Why am I losing customers, and how can I stop them from leaving?”

Even though the use of predictive analytics hold so much value for businesses and nonprofits, the general problem with implementing them is that the knowledge of how to do so is not readily available. Many people struggle when trying to make sense of good analysis practices, choosing appropriate predictive models for a given situation, and understanding the underlying statistics. To fill this gap of knowledge and provide an easy way to learn and take advantage of predictive analytics, Vault Analytics will be releasing a new book on August 2.

It contains detailed chapters describing how to do good analysis, how to choose an appropriate predictive model for your situation, and how to make sure the statistics powering the model are set up right. This is all done and explained in the familiar environment of Excel 2007, so that it can benefit those who may not have access to more advanced predictive analytical packages such as SAS and SPSS.

If you’d like to download the first few chapters for free, or pre-order the book, you can do so here.

Otherwise, below I’ve copied a section from the book that I think is extremely valuable for anyone new to data analysis. It describes two of the most important fundamentals: Seeing the data in context, and segmentation.

Seeing the Data in Context

Understanding what the data are telling you within the context of the business situation being analyzed is extremely important. This will help you avoid making faulty conclusions and keep your analysis appropriate for the business question being answered. The best way to learn this fundamental is to see it in action, so we will take an example.

We will look at a type of direct mail campaign analysis. We want to know how many calls are expected to come into our call center after we execute the campaign. First, we take some historical data showing us the percentage of total calls coming in according to the number of days after starting a mail campaign, shown below.

Data Table

After creating a scatter plot of the data, we try to fit a logarithmic regression line as a model, shown seen below.

logarithmic predictive model

Even though the R2 tells us that the fit is good, the model may not be the best way to explain this data when the context and purpose of this analysis are considered. We want the model to be able to predict what percentage of total calls will come in from a mailing campaign so we can staff the call center. If I were to use the line above as the model, I would be predicting low values for incoming calls between about day 20 and 100, and high values thereafter. Because of this error, we would not be staffing the call center correctly.

To create a better model, I would consider the fact that, in this context, it is not necessary to fit a trend model to the entire data set. Consider the following model, which can be used to predict the percentage of total calls coming in between days 4 and 35 after the mailing campaign:

analytical model

You will notice that this trend model does not contain the same high and low errors as the previous model did. Further, upon doing some calculations on the data in the spreadsheet, we know that anything before day 4 makes up for just 8% of all calls, and anything after day 35 makes up for just 15% of all calls. I have highlighted with a model the time period of the biggest growth to the call percentage, while summarizing the remaining percentages on either side. This will give just the right amount of information needed to staff the call center, while minimizing errors I would have made trying to fit a single trend model to the data.

The point here is to look at the data in the context of the purpose of the analysis. What are you going to use the predictive model for? Is it necessary to fit a model to the entire data set? How exact do you need to be with the prediction? What is the most important part of the data set to model? These and other questions are important to consider when performing analysis.

Segmentation

The second fundamental of analysis is the practice of segmenting the data. As with seeing the data in context, this is best described with an example. Consider the analysis presented below, which shows a linear regression model to predict how much someone will likely donate to your cause according to their age.

non segmented model

The fit of the model is extremely weak, and there seems to be no relationship between donation and age. However, this data was taken and aggregated from two different cities, Boston and New York. If we separate out the data according to those two cities (otherwise known as segmenting by them), we get the following when we run a regression analysis:
segmented model

By segmenting the data first, we notice that there is, in fact, a relationship between donation and age, but that relationship differs depending on what city you are in.

29 Comments

  1. Posted September 19, 2010 at | Permalink

    You post awsome articles, i have bookmarked for future referrence !

  2. Analytic Models
    Posted March 15, 2011 at | Permalink

    Analytic Models are mathematical models that have a closed form solution, i.e. the solution to the equations used to describe changes in a system can be expressed as a mathematical analytic function.

  3. Posted April 14, 2011 at | Permalink

    This is Awesome! Thank you so much.

  4. Posted April 17, 2011 at | Permalink

    Thank you for the good writeup. It in fact was a amusement account it. Look advanced to more added agreeable from you! By the way, how can we communicate?

  5. Posted April 18, 2011 at | Permalink

    So I sincerely declare you actually come up with many great points and I will publish a number of thoughts to add to shortly.

  6. Posted April 18, 2011 at | Permalink

    Great blog 9/10! Bookmarked :)

  7. Posted April 19, 2011 at | Permalink

    Really? It really is excellent to witness anyone finally begin addressing this stuff, however I’m still not really certain how much I agree with you on it all. I subscribed to your rss feed though and will certainly keep following your writing and possibly down the road I may chime in once again in much more detail. Cheers for blogging though!

  8. Posted April 20, 2011 at | Permalink

    F*ckin’ tremendous things here.I’m very glad to see your post.Thanks a lot and i am looking forward to contact you.Will you kindly drop me a mail?

  9. Posted April 20, 2011 at | Permalink

    Hello There. I found your blog using Twitter. This is a really well written article. I’ll be sure to bookmark it and return to read more of your useful information. Thanks for the post. I will definitely return.

  10. Posted April 21, 2011 at | Permalink

    Thanks, I liked this blog post. I found this site using AOL search, and certainly liked reading over it, so I’ll probably visit through again within a week and read up on what’s new :) Great Post!

  11. Posted April 22, 2011 at | Permalink

    Pretty interesting post. You have a interesting review on this matter and I’ll be subscribing to your RSS feed and will hope you will write frequently on similar matters. But I was curious on what your article sources for the post are? Thanks a lot

  12. Posted April 22, 2011 at | Permalink

    Very nice post. I just stumbled upon your blog and wished to say that I’ve really enjoyed surfing around your blog posts. After all I’ll be subscribing to your feed and I hope you write again very soon!

  13. Posted April 24, 2011 at | Permalink

    I love your blog.. very nice colors & theme. Did you design this website yourself or did you hire someone to do it for you? Plz respond as I’m looking to construct my own blog and would like to find out where u got this from. thank you

  14. Posted April 25, 2011 at | Permalink

    http://www.conveyancingquotes.info/?page_id=9

  15. Posted April 25, 2011 at | Permalink

    I view something genuinely special in this web site.

  16. Posted April 26, 2011 at | Permalink

    I’ve bookmarked http://vaultanalytics.com/marketinganalytics/2010/07/how-to-solve-problems-with-simple-predictive-analytics/ at Reddit.com so my friends can see it too. I used How to Solve Problems with Simple Predictive Analytics | Predictive Analytics Blog so it was a good title.

  17. Posted April 27, 2011 at | Permalink

    There is lots of good information on this site.

  18. Posted April 27, 2011 at | Permalink

    i cant truly assist with what you want but i seek a simular path and i do a simular deed.

  19. Posted April 28, 2011 at | Permalink

    Excellent beat ! I would like to apprentice while you amend your website, how can i subscribe for a blog web site? The account helped me a acceptable deal. I had been a little bit acquainted of this your broadcast offered bright clear idea

  20. Posted April 28, 2011 at | Permalink

    we really like this particular, where can I get extra info on this kind of subject?

  21. Posted April 28, 2011 at | Permalink

    Hi there just wanted to give you a quick heads up. The words in your post seem to be running off the screen in Ie. I’m not sure if this is a formatting issue or something to do with web browser compatibility but I thought I’d post to let you know. The layout look great though! Hope you get the problem resolved soon. Thanks

  22. Posted April 29, 2011 at | Permalink

    I loved reading this article I will make sure to tell my friends about this and link to it too. Thanks

  23. Posted April 30, 2011 at | Permalink

    link for you on my blog here http://tinyurl.com/page-id-9 ,I like this website so much, saved to favorites .

  24. Posted April 30, 2011 at | Permalink

    well… good luck discovering a free of charge minute. I hope you have better luck doing that than I do.

  25. Posted April 30, 2011 at | Permalink

    You actually theme likely have some dilemmas, i can’t often see your banner ad header.

  26. Posted April 30, 2011 at | Permalink

    I am glad to be a visitor of this stark weblog! , thanks for this rare info ! .

  27. Posted May 1, 2011 at | Permalink

    You completed various nice points there. I did a search on the subject and found a good number of folks will go along with with your blog.

  28. Posted May 1, 2011 at | Permalink

    Keep up the great piece of work, I read few blog posts on this site and I think that your site is rattling interesting and has circles of fantastic info . you have a link on my blog http://www.conveyancingquotes.info/?page_id=9

  29. Posted May 4, 2011 at | Permalink

    How can I locate out a lot more data on this matter?