# Regression Basics For Business Analysis

The intercept has no meaning for the model, as the purpose of regression analysis is to evaluate the relationship between the predictor and the target. However, the intercept is vital for calculating predictive values. It is a statistical equation that best fits a set of observations of dependent and independent variables. The purpose is to estimate the underlying relationship so that we can predict the target variable based on the other . Use account analysis to estimate the fixed costs per year, and the variable cost per unit. Software applications, such as Excel, can use regression analysis to estimate fixed and variable costs. The linear regression model’s slope coefficient is significant in econometrics . Calculate estimated production costs for January, February, and March. Also provide a total for the three months. Use the results of the regression analysis to estimate costs for 3,750 machine hours. For example, there may be a very high correlation between the number of salespeople employed by a company, the number of stores they operate, and the revenue the business generates. The relationship between dependent variable Y and independent variable X is linear in the slope and intercept ‘a’ and ‘b,’ respectively.

Assume you are a consultant performing work for two different companies. Each company has asked you to help them identify the behavior of certain costs. Contribution Margin Income Statement, Service Company.

## Accessible Colors For Data Visualization

This approach is likely to yield more accurate results than the high-low method when the high and low points are not representative of the entire set of data. Notice that fixed costs are much lower using the scattergraph method (\$5,000) than the high-low method (\$25,000). It is evident from this information that this company has very little in fixed costs and relatively high variable costs. This is indicative of a company that uses a high level of labor and materials and a low level of machinery .

• A box appears that requires the input of several items needed to perform regression.
• If our regression shows a value of 0.65, we can explain 65% of the dependent variable’s variability with the regression model.
• Zhang Corporation is trying to identify the cost behavior of the three costs shown.
• Python and R are both powerful coding languages that have become popular for all types of financial modeling, including regression.
• We need to standardize the covariance in order to allow us to better interpret and use it in forecasting, and the result is the correlation calculation.
• When the p-value is below the error margin (usually 0.05 for a 95% confidence interval, most common in finance), we deem the independent variable statistically significant.
• We have a dataset of 106 weekly observations of sales revenue amount and number of ad clicks from our marketing campaigns.

If one variable goes up and the other tends to go down, then the covariance would be negative. The company’s weekly sales appear to be quite volatile, but we can still see that our forecast somehow ‘fits’ with the rest of the chart. Looking at the equation, we have an intercept of €149,222, meaning on average, we should get about €150 thousand per week if we have zero ad clicks. The slope is at €1,325.2, which suggests that the company will generate about €1.3 thousand in sales revenue for each additional ad click.

## Building The Regression Model

Linear regression is basically a mathematical analysis method which considers the relationship between all the data points in a simulation. All these points are based upon two unknown variables – one independent and one dependent. The dependent variable will be plotted on the y-axis and the independent variable will be plotted to the x-axis on the graph of regression analysis. In literal manner, least square method of regression minimizes the sum of squares of errors that could be made based upon the relevant equation. Least squares regression method is a method to segregate fixed cost and variable cost components from a mixed cost figure. It is also known as linear regression analysis. The high-low method starts with the highest and lowest activity levels and uses four steps to estimate fixed and variable costs. Input X Range requires that you highlight the x-axis data, including the heading . Check the Labels box; this indicates that the top of each column has a heading . Select New Workbook; this will put the regression results in a new workbook. Lastly, check the Line Fit Plots box, then select OK.

Once the data analysis package is installed, historical data are entered in the spreadsheet, and the regression analysis is run. Measures the percent of the variance in the dependent variable explained by the independent variable. Go to the Data tab on the top menu bar and look for Data Analysis. If Data Analysis appears, you are ready to perform regression analysis. If Data Analysis does not appear, go to the help button (denoted as a question mark in the upper right-hand corner of the screen) and type Analysis ToolPak. Look for the Load the Analysis ToolPak option and follow the instructions given.

Linear regression attempts to estimate a line that best fits the data and the equation of that line results in the regression equation. When we use a small sample and put ‘enough’ predictor variables, we will almost certainly end up with a statistically significant model. This happens quite often, as we try to eliminate uncontrolled variables by adding them to our regression analysis. When the p-value is below the error margin (usually 0.05 for a 95% confidence interval, most common in finance), we deem the independent variable statistically significant. One of the measures we get from a regression analysis is the covariance. It calculates the relationship between two variables. This is why we introduce ɛ (residual/error) to the model — it covers the element of chance that an independent variable can experience variations.

The correlation calculation simply takes the covariance and divides it by the product of the standard deviation of the two variables. This will bind the correlation between a value of -1 and +1. R-squared suggests our model’s validity, and the p-value of each predictor shows if the relationship we noted in the sample also exists in the entire population. The coefficients are in line with what we see on the scatter plot — the two variables are highly positively correlated, meaning that when ad clicks increase, so does sales revenue. We have a dataset of 106 weekly observations of sales revenue amount and number of ad clicks from our marketing campaigns.

That’s where correlation, another measure of regression analysis, comes in. It helps us to standardize the covariance to be able to better understand and use it in forecasting. High-Low, Scattergraph, and Regression Analysis; Manufacturing Company. Woodworks, Inc., produces cabinet doors.

Once each of the independent variables has been determined, they can be used to predict the amount of effect that the independent variables have on the dependent variable. The effect is represented on a straight line to approximate each of the data points. Overall, simple linear regression analysis can be beneficial and is mostly easy to set up. This makes it a favored technique in the financial professional’s toolbox. Using the four equations listed in your answer to 1, calculate total production costs assuming Alta Production, Inc., will produce 400 units next month.

## Regression Analysis In Financial Modeling

These help us assess whether the relationships in our observations also exist in the broader population. The p-value for each predictor evaluates the null hypothesis that the variable shows no correlation with the dependent variable. Account analysis requires that a knowledgeable employee determine whether costs are fixed, variable, or mixed. If employees do not have enough experience to accurately estimate these costs, another method should be used. Thus total production costs are expected to be \$577,891 for next month.

Cost accountants will often throw out the high and low points for this reason and use the next highest and lowest points to perform this analysis. Regression analysis tends to yield the most accurate estimate of fixed and variable costs, assuming there are no unusual data points in the data set. It is important to review the data set first—perhaps in the form of a scattergraph—to confirm that no outliers exist. At the heart of a regression model is the relationship between two different variables, called the dependent and independent variables. For instance, suppose you want to forecast sales for your company and you’ve concluded that your company’s sales go up and down depending on changes in GDP.

A method of cost analysis that uses a set of data points to estimate fixed and variable costs. A method of cost analysis that uses the high and low activity data points to estimate fixed and variable costs. You are the accountant who assists Jim in preparing an estimate of production costs for the next three months. You intend to use regression analysis to estimate costs, as was done in the past. Jim expects that 3,100 machine hours will be used in January, 3,650 machine hours in February, and 2,850 machine hours in March.

## Simple Linear Regression Analysis

This step requires that each data point be plotted on a graph. The x-axis reflects the level of activity , and the y-axis reflects the total production cost. Figure 5.5 “Scattergraph of Total Mixed Production Costs for Bikes Unlimited” shows a scattergraph for Bikes Unlimited using the data points for 12 months, July through June. Estimate costs using account analysis, the high-low method, the scattergraph method, and regression analysis. We will begin by learning the core principles of regression, first learning about covariance and correlation, and then moving on to building and interpreting a regression output. Popular business software such as Microsoft Excel can do all the regression calculations and outputs for you, but it is still important to learn the underlying mechanics.

Contribution Margin Income Statement. Last month Kumar Production Company sold its product for \$60 per unit. Fixed production costs were \$40,000, and variable production costs amounted to \$15 per unit. Fixed selling and administrative costs totaled \$26,000, and variable selling and administrative costs amounted to \$5 per unit. Kumar Production produced and sold 7,000 units last month.

The monthly data in Table 5.4 “Monthly Production Costs for Bikes Unlimited” includes Total Production Costs and Units Produced. Thus use one column to enter Total Production Costs data and another column to enter Units Produced data.

The scattergraph method has five steps and starts with plotting all points on a graph and fitting a line through the points. This line represents costs throughout a range of activity levels and is used to estimate fixed and variable costs. The scattergraph is also used to identify any outlying or unusual data points. The y-intercept represents total fixed costs. This is where the line meets the y-axis. Total fixed costs in the graph appear to be approximately \$5,000.

## Accounting For Managers

Thus total production costs are expected to be \$580,756 for next month. Thus total production costs are expected to be \$625,000 for next month. The highest level of activity occurred in November (450 units; \$700,000 production costs), and the lowest level of activity occurred in October (150 units; \$250,000 production costs). Thus total production costs are expected to be \$578,428 for next month. Marie likes the regression analysis approach and asks you to estimate costs for 800 tax returns using this approach . Prepare a contribution margin income statement assuming 5,000 helmets will be produced, and each helmet will be sold for \$70.

Last month Seafood Grill had total sales of \$200,000. Food preparation and service costs totaled \$90,000 .

## Overfitting The Model

Explain how account analysis is used to estimate costs. Using the same spreadsheet set up in step 2, select Data, Data Analysis, and Regression. A box appears that requires the input of several items needed to perform regression. Input Y Range requires that you highlight the y-axis data, including the heading .

Use Excel to perform regression analysis after adding \$100,000 to production costs for each of the past 18 months, as Jim requested. Cordova Company would like to estimate production costs on an annual basis. Costs incurred for direct materials and direct labor are variable costs. The accounting records indicate that the following production costs were incurred last year for 50,000 units. A statistical tool that uses the least-squares method to estimate the fixed and variable components of mixed costs. There is a linear relationship between dependent and independent variables. Multiple regression is a statistical technique that predicts the value of one variable using the value of two or more independent variables.

Posted on Posted on: 10.10.2023Categories Business Accounting Blog