Those of you who took statistics in college probably remember the class with varying degrees of fondness. Personally, I loved my statistics class. I realized at the time that multiple linear regression was the perfect tool to quantify the effect of various property characteristics on the overall value. The theory of statistics is the mathematical basis for what appraisers traditionally call “paired sales analysis”. Given two houses which are identical in all respects except one, the difference in price should be explainable by that single differing characteristic. For example, two nearly identical houses on the same street sold on the same day. The one with a two car garage sold for $10,000 more than the one without. One can infer that the amount contributed by a two car garage is $10,000.
There’s only one problem with that methodology. The sample size from which the conclusion is drawn is too small. A conclusion is drawn from a sample size of two is hardly large enough to draw statistically significant conclusions about a larger population of sales. Most appraisal reports present very few sales from which to conclude an opinion of value. Data only tends to take a normal shape with a sample size of 30. Early on in my career I realized the quandary we appraisers were in. We are tasked with presenting a defensible appraisal report while severely handicapped in the volume of data available for analysis. Oh how things have changed.
It is now possible to download and analyze hundreds…even thousands of commercial or residential real estate sales into a single file with multiple linear regression. I won’t bore you with the technical details but what multiple linear regression allows is to derive an equation used to estimate the value of real estate. Suppose, for example, that you have evaluated which property characteristics significantly contribute to the value of a single family home. You may find that the gross living area has the strongest correlation to value followed by the number of bathrooms and whether the home has a garage or a basement. I ran just this sort of analysis on homes around Elizabethtown, Kentucky, which resulted in the following regression equation:
Y (Predicted Price) = -9638 + (GLA * 74.47) + (AGE*-800) + (BED * 200) + (BATH * 7,802) +(BSM * 5,898) + (ACRES * 4,792) + (GAR * 22,218) + (FIN BSM SF * -15)
Notice that age has a negative correlation with the predicted price, as one might expect. What surprised me was the low contribution for each bedroom. Another interesting aspect of this equation is that, while the presence of a basement has a positive correlation, finished basement space has a negative correlation. A logical conclusion is that basement space contributes positively though that positive contribution is negated if you finish too much of that space. In this case, the point at which the finished basement space ceases to make a positive contribution may be calculated as 5,898/15 = 393.2
Another cool aspect of being able to download masses of data is that it allows an appraiser to graphically depict various relationships. For example, I’m using the chart below in an appraisal to demonstrate the relationship between the size and price of a site.
As shown, the price per square foot that a purchaser is willing to pay tends to decrease as the size of the site increases. We’ve know this for years but to be able to graph and quantify the effect, well, that’s pretty exciting stuff if you’re an appraiser. Anyhow, I just wanted to update my blog and let you guys know what’s going on. I’ll post more later.
If you have any other questions about commercial appraisals, contact us here!
Wyatt Roberts says
Great post! I wish more appraisers understood the power of statistics. So-called “paired sales analysis” is a joke. Thank you for this excellent article. (And BTW, I suspect the reason bedrooms have such little influence in your equation is, primarily, because of GLA, and also because very few buyers desire homes that have less or more than what is typical.)
RussellRoberts says
Thanks, Wyatt. You inspire me in my appraisal work. I’m just trying to emulate the master appraiser!
Wyatt Roberts says
You, my brother, are my hero! And look at that. 21 Likes!
RussellRoberts says
What the heck? Did you send a bunch of people to like this article?
richard pfahl says
Which program now on the market do you feel is the most practical?
Russell Roberts says
Richard, I haven’t checked out the programs that do this automatically. I only do commercial work and use Excel. If you need any help, let me know.
James Deppe says
Real Estate sales per SF usually follow a exponential curve per research articles. The only time you can use a straight line for plotting is if the size does not very significantly.
James Deppe says
Real Estate sales per SF usually follow a exponential curve per research articles. The only time you can use a straight line for plotting is if the size does not very significantly.
For a large amount of real estate data, for example house sales, is to use a closed end modeling program using an equation weighted with predictor variables determined by multiple regression.
Jon says
One flaw is that your model is actually not linear, you can tell by the way the data is distributed. You should not be running a straight line through it but a different equation that fits the shape of the model to get a better expectation of the data. Also some of the other variables you mention do not have much additional explanatory power (such as bedrooms because they are highly correlated to sqft thus you are measuring with the same thing twice).
Pokemon Quest Online Generator says
Hi there, after reading this remarkable paragraph i am too delighted to share my knowledge here with
colleagues.
hoe kan ik kijken Privé instagramafbeeldingen says
I think this is one of the such a lot important info for me.
And i’m happy reading your article. However wanna commentary on some general things, The website
style is wonderful, the articles is truly great : D. Excellent process, cheers