{"id":677,"date":"2018-06-17T06:11:52","date_gmt":"2018-06-17T06:11:52","guid":{"rendered":"http:\/\/muthu.co\/?p=677"},"modified":"2021-05-24T03:37:52","modified_gmt":"2021-05-24T03:37:52","slug":"maths-behind-polynomial-regression","status":"publish","type":"post","link":"http:\/\/write.muthu.co\/maths-behind-polynomial-regression\/","title":{"rendered":"Maths behind Polynomial regression"},"content":{"rendered":"\n<p>Polynomial regression is a process of finding a polynomial function that takes the form&nbsp;<i>f<\/i>(&nbsp;<i>x<\/i>&nbsp;)&nbsp;=&nbsp;<i>c<\/i><sub>0<\/sub>&nbsp;+&nbsp;<i>c<\/i><sub>1&nbsp;<\/sub><i>x<\/i>&nbsp;+&nbsp;<i>c<\/i><sub>2<\/sub>&nbsp;<i>x<\/i><sup>2<\/sup>&nbsp;\u22ef&nbsp;<i>c<\/i><sub><i>n<\/i><\/sub><i>&nbsp;<\/i><i>x<\/i><sup><i>n<\/i><\/sup>&nbsp;where&nbsp;<i>n<\/i>&nbsp;is the degree of the polynomial and&nbsp;<i>c<\/i>&nbsp;is a set of coefficients. Through polynomial regression we try to find an nth degree polynomial function which is the closest approximation of our data points. Below is a sample random dataset which has been regressed upto 3 degree and plotted on a graph.&nbsp;The blue dots represent our data set and the lines represent our polynomial functions of&nbsp;different degrees. The higher the degree, closer is the approximation but higher doesn&#8217;t always mean right which we will discuss in later articles.<\/p>\n\n\n\n<pre class=\"wp-block-preformatted EnlighterJSRAW\">y_train=[[100],[110],[124],[142],[159],[161],[170],[173],[179],[180],[217],[228],[230],[284],[300],[330],[360],[392],[414],[435],[451],[476],[499],[515],[543],[564]]<\/pre>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_7.png\"><img loading=\"lazy\" decoding=\"async\" width=\"625\" height=\"459\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_7.png\" alt=\"\" class=\"wp-image-681\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_7.png 625w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_7-300x220.png 300w\" sizes=\"auto, (max-width: 625px) 100vw, 625px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>To understand the math behind the above analysis lets start by constructing a line function that passes through two points: (1, 1) and (6, 9). The general formula of the equation of line:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_58.png\"><img loading=\"lazy\" decoding=\"async\" width=\"105\" height=\"23\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_58.png\" alt=\"\" class=\"wp-image-721\"\/><\/a><\/figure><\/div>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_9.png\"><img loading=\"lazy\" decoding=\"async\" width=\"144\" height=\"54\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_9.png\" alt=\"\" class=\"wp-image-682\"\/><\/a><\/figure><\/div>\n\n\n\n<p>Our job here is to find the two unknowns <em>m<\/em> and <em>b<\/em> in the above line equation. By putting our sample points in the above line equation, we get:<br>we can solve for m and b using substitution method:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_10.png\"><img loading=\"lazy\" decoding=\"async\" width=\"305\" height=\"246\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_10.png\" alt=\"\" class=\"wp-image-683\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_10.png 305w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_10-300x242.png 300w\" sizes=\"auto, (max-width: 305px) 100vw, 305px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>Now substituting <em>m<\/em> and <em>b<\/em> in our line equation, we get the below formula:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_14.png\"><img loading=\"lazy\" decoding=\"async\" width=\"132\" height=\"18\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_14.png\" alt=\"\" class=\"wp-image-687\"\/><\/a><\/figure><\/div>\n\n\n\n<p>The above algebraic method works best for small number and one degree polynomials but for higher degree polynomials we will be using matrices and linear algebra. But before we dive into the math of higher degree functions lets work out the same equation we derived from our previous example using matrices.<\/p>\n\n\n\n<p>Our data points in a matrix will look like:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_13.png\"><img loading=\"lazy\" decoding=\"async\" width=\"286\" height=\"108\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_13.png\" alt=\"\" class=\"wp-image-686\"\/><\/a><\/figure><\/div>\n\n\n\n<p>Solving it using linear algebra:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_16.png\"><img loading=\"lazy\" decoding=\"async\" width=\"205\" height=\"208\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_16.png\" alt=\"\" class=\"wp-image-688\"\/><\/a><\/figure><\/div>\n\n\n\n<p>As you can see, we get the same equation of line using matrix and linear algebra.&nbsp;The above method works great when we have 2 points because we know we can draw a straight line passing through them but we cannot assure the same when we have more than 2 data points. In my article on <a href=\"http:\/\/muthu.co\/math-behind-linear-regression-and-python-code\/\">linear regression<\/a> I used the below formula to find the coefficients of linear equation <em>y = Bx + A.&nbsp;<\/em><\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/05\/7429276_f520.jpg\"><img loading=\"lazy\" decoding=\"async\" width=\"520\" height=\"221\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/05\/7429276_f520.jpg\" alt=\"\" class=\"wp-image-635\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/05\/7429276_f520.jpg 520w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/05\/7429276_f520-300x128.jpg 300w\" sizes=\"auto, (max-width: 520px) 100vw, 520px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>How we derive the above equation will give us the general formula for a polynomial function of any degree.<\/p>\n\n\n\n<p>In any regression analysis we try to find an equation of line which is a close approximation of the actual training data points. We use a method called the <a href=\"http:\/\/muthu.co\/evaluating-the-fitness-of-a-modal-using-a-cost-function\/\">&#8220;Sum of Least squares&#8221;<\/a> to derive our equation. Basically what do is find all possible line equations for a given set of data points and select the one which has the least squared sum of errors or residuals &#8211; Residuals or errors being the difference between the predicted value and the actual value.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_19.png\"><img loading=\"lazy\" decoding=\"async\" width=\"605\" height=\"452\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_19.png\" alt=\"\" class=\"wp-image-689\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_19.png 605w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_19-300x224.png 300w\" sizes=\"auto, (max-width: 605px) 100vw, 605px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>Least squares has been beautifully explained in the below video:<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Introduction to residuals and least squares regression\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/yMgFHbjbAW8?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<p>Sum of least squares as we saw in our <a href=\"http:\/\/muthu.co\/evaluating-the-fitness-of-a-modal-using-a-cost-function\/\">previous article<\/a> is given by :<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/05\/Snip20180526_58.png\"><img loading=\"lazy\" decoding=\"async\" width=\"213\" height=\"63\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/05\/Snip20180526_58.png\" alt=\"\" class=\"wp-image-644\"\/><\/a><\/figure><\/div>\n\n\n\n<p>So for any point on a graph the residual is given by the general form:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_20.png\"><img loading=\"lazy\" decoding=\"async\" width=\"155\" height=\"25\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_20.png\" alt=\"\" class=\"wp-image-690\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_20.png 155w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_20-150x25.png 150w\" sizes=\"auto, (max-width: 155px) 100vw, 155px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>and the sum of squares of residuals become:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_59.png\"><img loading=\"lazy\" decoding=\"async\" width=\"468\" height=\"116\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_59.png\" alt=\"\" class=\"wp-image-722\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_59.png 468w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_59-300x74.png 300w\" sizes=\"auto, (max-width: 468px) 100vw, 468px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>Now if we have to find a regression function then we will have to find an equation which gives the least sum of errors. We will be using partial derivatives to find the minima of our sum of least squares function:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_23.png\"><img loading=\"lazy\" decoding=\"async\" width=\"271\" height=\"56\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_23.png\" alt=\"\" class=\"wp-image-692\"\/><\/a><\/figure><\/div>\n\n\n\n<p>Taking the derivatives with respect to <em>m<\/em> and <em>b<\/em> we get:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_32.png\"><img loading=\"lazy\" decoding=\"async\" width=\"223\" height=\"105\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_32.png\" alt=\"\" class=\"wp-image-698\"\/><\/a><\/figure><\/div>\n\n\n\n<p>Now equating the partial derivatives to <em>0<\/em> to find the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Maxima_and_minima\">minimum<\/a> value of m and b we get:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_37.png\"><img loading=\"lazy\" decoding=\"async\" width=\"501\" height=\"543\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_37.png\" alt=\"\" class=\"wp-image-701\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_37.png 501w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_37-277x300.png 277w\" sizes=\"auto, (max-width: 501px) 100vw, 501px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>Moving the constants out and also moving <em>y <\/em>to the right side<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38.png\"><img loading=\"lazy\" decoding=\"async\" width=\"395\" height=\"117\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38.png\" alt=\"\" class=\"wp-image-702\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38.png 395w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38-300x89.png 300w\" sizes=\"auto, (max-width: 395px) 100vw, 395px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>The second equation above can be represented with <em>b <\/em>on the left side:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_39.png\"><img loading=\"lazy\" decoding=\"async\" width=\"222\" height=\"61\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_39.png\" alt=\"\" class=\"wp-image-703\"\/><\/a><\/figure><\/div>\n\n\n\n<p>substituting b in equation 1 we can find the formula to calculate <em>m<\/em><\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_41.png\"><img loading=\"lazy\" decoding=\"async\" width=\"548\" height=\"473\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_41.png\" alt=\"\" class=\"wp-image-704\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_41.png 548w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_41-300x259.png 300w\" sizes=\"auto, (max-width: 548px) 100vw, 548px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>in the similar fashion we can also derive the formula for <em>b<\/em><\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_43.png\"><img loading=\"lazy\" decoding=\"async\" width=\"296\" height=\"93\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_43.png\" alt=\"\" class=\"wp-image-705\"\/><\/a><\/figure><\/div>\n\n\n\n<p>We derived the above formulas using algebra, we could have done it using matrices as well. Our initial two equations given by:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38.png\"><img loading=\"lazy\" decoding=\"async\" width=\"395\" height=\"117\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38.png\" alt=\"\" class=\"wp-image-702\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38.png 395w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180616_38-300x89.png 300w\" sizes=\"auto, (max-width: 395px) 100vw, 395px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>when represented in a matrix form looks like:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_44.png\"><img loading=\"lazy\" decoding=\"async\" width=\"271\" height=\"135\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_44.png\" alt=\"\" class=\"wp-image-708\"\/><\/a><\/figure><\/div>\n\n\n\n<p>Evaluating the above we would get the same formula we found using algebra.<\/p>\n\n\n\n<p>Lets see if we can extend the same method to derive a generic formula for higher degree polynomials. To start, lets use a quadratic equation represented by :<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_50.png\"><img loading=\"lazy\" decoding=\"async\" width=\"314\" height=\"216\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_50.png\" alt=\"\" class=\"wp-image-712\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_50.png 314w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_50-300x206.png 300w\" sizes=\"auto, (max-width: 314px) 100vw, 314px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>As you can see we have 3 coefficients <em>a, b<\/em> and <em>c<\/em> in the equation. Partials derivate of a is given by:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_51.png\"><img loading=\"lazy\" decoding=\"async\" width=\"547\" height=\"262\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_51.png\" alt=\"\" class=\"wp-image-713\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_51.png 547w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_51-300x144.png 300w\" sizes=\"auto, (max-width: 547px) 100vw, 547px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>partial derivative of b:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_52-1.png\"><img loading=\"lazy\" decoding=\"async\" width=\"552\" height=\"258\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_52-1.png\" alt=\"\" class=\"wp-image-715\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_52-1.png 552w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_52-1-300x140.png 300w\" sizes=\"auto, (max-width: 552px) 100vw, 552px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>partial derivative of c:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_54.png\"><img loading=\"lazy\" decoding=\"async\" width=\"540\" height=\"259\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_54.png\" alt=\"\" class=\"wp-image-717\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_54.png 540w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_54-300x144.png 300w\" sizes=\"auto, (max-width: 540px) 100vw, 540px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>Now to find the minima, we will set the partial derivatives to 0.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_55.png\"><img loading=\"lazy\" decoding=\"async\" width=\"301\" height=\"205\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_55.png\" alt=\"\" class=\"wp-image-718\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_55.png 301w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_55-300x204.png 300w\" sizes=\"auto, (max-width: 301px) 100vw, 301px\" \/><\/a><\/figure><\/div>\n\n\n\n<p>We can represent the above equations in matrix form:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_56.png\"><img loading=\"lazy\" decoding=\"async\" width=\"226\" height=\"106\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20180617_56.png\" alt=\"\" class=\"wp-image-719\"\/><\/a><\/figure><\/div>\n\n\n\n<p class=\"Main\">So, if&nbsp;<i>m<\/i>&nbsp;is the degree of the polynomial, and&nbsp;<i>n<\/i>&nbsp;is the number of known data points, we get a generalized matrix:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><a href=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20191206_1.png\"><img loading=\"lazy\" decoding=\"async\" width=\"536\" height=\"306\" src=\"https:\/\/muthu.co\/wp-content\/uploads\/2018\/06\/Snip20191206_1.png\" alt=\"\" class=\"wp-image-1243\" srcset=\"http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20191206_1.png 536w, http:\/\/write.muthu.co\/wp-content\/uploads\/2018\/06\/Snip20191206_1-300x171.png 300w\" sizes=\"auto, (max-width: 536px) 100vw, 536px\" \/><\/a><\/figure><\/div>\n\n\n\n<div>This is the matrix representation of polynomial regression for any degree polynomial. An overdetermined system is solved by first creating a residual function, summing the square of the residual which forms a parabola\/paraboloid, and then finding the coefficients by finding the minimum of the parabola\/paraboloid using partial derivatives.<\/div>\n\n\n\n<div>Thanks to&nbsp;<a href=\"https:\/\/www.mathcha.io\/editor#\">mathcha.io<\/a> for an amazing maths editor which helped me build all my equations in this article.<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Polynomial regression is a process of finding a polynomial function that takes the form&nbsp;f(&nbsp;x&nbsp;)&nbsp;=&nbsp;c0&nbsp;+&nbsp;c1&nbsp;x&nbsp;+&nbsp;c2&nbsp;x2&nbsp;\u22ef&nbsp;cn&nbsp;xn&nbsp;where&nbsp;n&nbsp;is the degree of the polynomial and&nbsp;c&nbsp;is a set of coefficients. Through polynomial regression we try to find an nth degree polynomial function which is the closest approximation of our data points. Below is a sample random dataset which has been regressed [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24,37,6],"tags":[46,49,54],"class_list":["post-677","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-data-science","category-mathematics","tag-artificial-intelligence","tag-data-science","tag-mathematics"],"_links":{"self":[{"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/posts\/677","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/comments?post=677"}],"version-history":[{"count":2,"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/posts\/677\/revisions"}],"predecessor-version":[{"id":1894,"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/posts\/677\/revisions\/1894"}],"wp:attachment":[{"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/media?parent=677"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/categories?post=677"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/write.muthu.co\/wp-json\/wp\/v2\/tags?post=677"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}