Home

XLeratorDB function packages for SQL Server
financial view documentation pricing
statistics view documentation pricing
math view documentation pricing
engineering view documentation pricing
strings view documentation pricing
financial-options view documentation pricing
windowing view documentation pricing

XLeratorDB Compilation packages for SQL Server
Suite incl: financial, statistics, math, engineering & strings pricing
Suite (Developer) requires SQL Server Developer Edition pricing
Suite (Subscription) One-year non-recurring license pricing

SuitePLUS incl: all Suite packages PLUS financial-options pricing
SuitePLUS (Developer) requires SQL Server Developer Edition, also incl: financial-options pricing
SuitePLUS (Subscription) One-year non-recurring license, also incl: financial-options pricing

XLeratorDLL function packages Microsoft .NET API Library
financial (DLL) view documentation pricing SQL Server not required

View All Product Pricing ...

Download Free 15 Day Trial ...

Documentation

Purchase

XLeratorDB function packages for SQL Server (2008 & later)
financial
statistics
math
engineering
strings
financial-options
windowing

XLeratorDB Compilation packages for SQL Server (2008 & later)
Suite
Suite (Developer)
Suite (Subscription)

SuitePLUS
SuitePLUS (Developer)
SuitePLUS (Subscription)

XLeratorDLL function packages Microsoft .NET API Library
financial (DLL)

Legacy XLeratorDB Packages for SQL Server 2005
financial for SQL Server 2005 only
statistics for SQL Server 2005 only
math for SQL Server 2005 only

Suite for SQL Server 2005 only
Suite (Developer) for SQL Server 2005 only
SuitePLUS for SQL Server 2005 only
SuitePLUS (Developer) for SQL Server 2005 only

Download Trial
Case Studies
Blog
Support

XLeratorDB/statistics Documentation

SQL Server LINEST function

LINEST

Updated: 19 January 2017

Use the table-valued function LINEST to calculate the Ordinary Least Squares (OLS) solution for a series of x- and y-values. The OLS solution calculates a line that best fits the data supplied to the function. The LINEST function returns the statistics that describe the calculated solution, including the coefficients (m), the standard error of the coefficients (se), the t statistic for each coefficient (tstat) and the associated p-values (pval), the coefficient of determination (rsq), the adjusted r-square value (rsqa) and the modified r-square value (rsqm), the standard error of the y estimate (sey), the F-observed value (F), the residual degrees of freedom (df), the regression sum of squares (ss_reg), and the residual some of squares.

In the case where we have one column of x-values and a column of y-values, the OLS solution is immediately recognizable as the formula for a line:

For the purpose of this function, we would re-write the equation as

The value for slope, then, is stored in m₀.

For purposes of multi-linear regression, where there are multiple columns of x-values (and still a single column of y-values), the formula for the solution is described by the following equation:

Where n is the number of x-columns.

The function expects the input to be in row-column, or spreadsheet, format, rather than in third-normal form. Thus, the input into the function requires the specification of the column names for the x- and y-values, the specification of a 1-based index identifying the column number for the y-values, and a bit value which specifies whether or not the solution has a non-zero y-intercept. The column specifications and the table or view that contains the data are passed into the function as strings. The function dynamically creates SQL and the resultant table from the SQL is used as input into the OLS calculations.

LINEST automatically detects collinearity and removes the right-most co-linear column resulting in a regression coefficient of 0 for that column.

Syntax

SELECT * FROM [wct].[LINEST](

<@TableName, nvarchar(max),>

,<@ColumnNames, nvarchar(4000),>

,<@GroupedColumnName, nvarchar(4000),>

,<@GroupedColumnValue, sql_variant,>

,<@Y_ColumnNumber, int,>

,<@Lconst, bit,>)

Arguments

@TableName

the name, as text, of the table or view that contains the values used in the LINEST calculation.

@ColumnNames

the name, as text, of the columns in the table or view specified by @TableName that contain the values used in the LINEST calculation. Data returned from the @ColumnNames must be of the type float or of a type that implicitly converts to float.

@GroupedColumnName

the name, as text, of the column in the table or view specified by @TableName which will be used for grouping the results.

@GroupedColumnValue

the column value to do the grouping on.

@y_ColumnNumber

the index into the array identifying the column containing the y-values. The index value must be between 1 and n, where n is the number of columns specified in @ColumnNames. @y_ColumnNumber must be of the type int or of a type that implicitly converts to int.

@LConst

A logical value specifying whether to force the y-intercept value (m₀) equal to zero. @LConst must be of the type bit or of a type that implicitly converts to bit.

Return Type

RETURNS TABLE (

[stat_name] [nvarchar](10) NULL,

[idx] [int] NULL,

[stat_val] [float] NULL,

[col_name] [nvarchar](128) NULL

)

Table Description

stat_name

Identifies the statistic being returned:

m	the estimated coefficient
se	the standard error of the estimated coefficient
tstat	the t statistic
pval	the p-value (t distribution) for the t statistic
rsq	the coefficient of determination (r²)
rsqa	adjusted r square
rsqm	multiple r square
sey	the standard error for the y estimate
f	the f-observed value
df	the residual degrees of freedom
ss_reg	the regression sum of squares
ss_resid	the residual sum of squares

idx

Identifies the subscript for the estimated coefficient, the standard error of the estimated coefficient, the t statistic, and the p-value. For example, the stat_name m with an idx of 0, specifies that the stat_val is for m₀, or the y-intercept (which is b in y = mx + b). An idx of 1 for the same stat_name identifies m₁.

The stat_name se with an idx of 0 identifies the standard error of the m₀ coefficient (which is sometimes referred to as the standard error of b or se_b).

idx values are only supplied for the m, se, tstat, and pval stat_names. All others will have an idx of NULL.

stat_val

the calculated value of the statistic.

col_name

the column name from the resultant table produced by the dynamic SQL. col_name values are produced only for the m, se, tstat, and pval statistics; all other stat_names have NULL for col_name.

Remarks

· If @Lconst is NULL then @Lconst is set to 'True'.

· If @LConst is true than the number of rows must be greater than the number of columns.

· If @Lconst is false than the number of rows must be greater than or equal to the number of columns.

· For more complicated queries, you can try the LINEST_q function.

Examples

Example #1

We put the x- and y-data into a temp table, #xy

SELECT

INTO

#xy

FROM (VALUES

(6.1,1.21,4.35,5.42,6.45,138.08)

,(7.95,0.97,2.79,4.73,8.14,80.3)

,(8.53,9.73,9.7,1.16,9.05,284.45)

,(7.4,1.61,9.9,8.8,4.38,226.66)

,(7.42,4.58,0.06,8.97,8.75,112.37)

,(6.19,3.56,9.69,8.7,5.67,168.73)

,(8.44,3.85,8.23,1.05,0.92,160.38)

,(0.84,3.86,7.85,2.14,3.03,129.26)

,(0.37,7.33,5.07,8.06,3.25,170.39)

,(7.48,0.68,8.34,2.98,2.81,188.53)

,(5.33,3.51,7.03,6.49,7.54,131.1)

)n(x1,x2,x3,x4,x5,y)

This is what the data look like.

To invoke the table-valued function LINEST, we enter the following SQL.

SELECT

FROM

wct.LINEST(

'#xy', --@TableName

'x1,x2,x3,x4,x5,y', --@ColumnNames

'', --@GroupedColumnName

NULL, --@GroupedColumnValue

6, --@Y_ColumnNumber

'True' --@Lconst

)

Note that the table name and the column names are both bound by single-quotes, so that they are passed into the function as a string. Essentially the function is dynamically creating a SQL statement to SELECT the column names from the #xy table. This means that we can actually simplify the second parameter, since we are selecting all the columns in this particular table, by entering the following:

SELECT

FROM

wct.LINEST(

'#xy', --@TableName

*, --@ColumnNames

'', --@GroupedColumnName

NULL, --@GroupedColumnValue

6, --@Y_ColumnNumber

'True' --@Lconst

)

The third parameter is entered as blank (2 single quotes), since there is no column on which we want to group the results for input into the calculation and the fourth parameter is NULL, since there is no grouped column name. The fifth parameter, the y-column value, specifies that the 6^th column in the resultant table, which becomes the input into the calculation, contains the y-values. The last parameter specified that we want to calculate the y-interecpt.

The following results are produced.

stat_name          idx               stat_val col_name
---------- ----------- ---------------------- --------------
m                    0      -2.00793721243439 Intercept
m                    1       6.75602376546862 x1
m                    2       10.7172546474712 x2
m                    3        11.482088782086 x3
m                    4       2.66947583235558 x4
m                    5      -1.11015571062543 x5
se                   0       56.6362962641145 Intercept
se                   1       5.18738179897952 x1
se                   2        5.0471020356278 x2
se                   3       4.44876625513428 x3
se                   4       4.35402063583205 x4
se                   5       5.97904930603055 x5
tstat                0    -0.0354531871764828 Intercept
tstat                1       1.30239570312671 x1
tstat                2       2.12344719243191 x2
tstat                3       2.58096023112803 x3
tstat                4      0.613105921085155 x4
tstat                5     -0.185674285961434 x5
pval                 0      0.973090230335637 Intercept
pval                 1      0.249542367556677 x1
pval                 2      0.087124225219611 x2
pval                 3     0.0493746060236316 x3
pval                 4      0.566619010443336 x4
pval                 5      0.859997582994016 x5
rsq               NULL      0.777597428923374 NULL
sey               NULL       37.5668536635793 NULL
F                 NULL       3.49635089720011 NULL
df                NULL                      5 NULL
ss_reg            NULL       24671.4493290961 NULL
ss_resid          NULL        7056.3424709039 NULL
rsqm              NULL      0.881814849570687 NULL
rsqa              NULL      0.555194857846747 NULL

The results are returned in 3^rd normal form. You can use standard SQL commands to re-format the results. For example, if you wanted to produce the the coefficients in a format similar to output of the Excel Data Analyis Regression Tool, you could use the following SQL.

;WITH mycte as (

SELECT *

FROM wct.LINEST('#xy','*','',NULL,6,1)p

)

SELECT

d.col_name

,d.m

,d.se

,d.tstat

,d.pval

,m-wct.T_INV_2T(.05,m.stat_val)*se as [Lower Confidence Level]

,m+wct.T_INV_2T(.05,m.stat_val)*se as [Upper Confidence Level]

FROM mycte p

PIVOT(MAX(stat_val) FOR stat_name in(m,se,tstat,pval))d

CROSS JOIN mycte m

WHERE stat_name = 'df'

AND d.col_name IS NOT NULL

This produces the follwing result.

col_name                           m                     se                  tstat                   pval Lower Confidence Level Upper Confidence Level
------------- ---------------------- ---------------------- ---------------------- ---------------------- ---------------------- ----------------------
Intercept          -2.00793721243439       56.6362962641145    -0.0354531871764828      0.973090230335637      -147.596171626684       143.580297201815
x1                  6.75602376546862       5.18738179897952       1.30239570312671      0.249542367556677      -6.57856566149857       20.0906131924358
x2                  10.7172546474712        5.0471020356278       2.12344719243191      0.087124225219611      -2.25673416791668       23.6912434628591
x3                   11.482088782086       4.44876625513428       2.58096023112803     0.0493746060236316     0.0461710556459689       22.9180065085259
x4                  2.66947583235558       4.35402063583205      0.613105921085155      0.566619010443336      -8.52289052609997       13.8618421908111
x5                 -1.11015571062543       5.97904930603055     -0.185674285961434      0.859997582994016      -16.4797912510815       14.2594798298306

Similarly, if you wanted to reformat the results to produce the equivalent of the ANOVA table from the Excel Data Analysis Regression tool, you could use the following SQL.

;WITH mycte as (

SELECT

df as [Residual df]

,Obs - df - 1 as [Regression df]

,ss_reg as [Regression SS]

,ss_resid as [Residual SS]

FROM (

SELECT

stat_name

,stat_Val

FROM

wct.LINEST('#xy','*','',NULL,6,1)

WHERE

stat_name in('ss_reg','ss_resid','F','df')

UNION

SELECT

'Obs'

,COUNT(*)

FROM

#xy)d

PIVOT(MAX(stat_val) FOR stat_name in(df,F,ss_reg,ss_resid,Obs))p

)

SELECT

'Regression'

,[Regression df] as DF

,[Regression SS] as SS

,[Regression SS]/[Regression df] as MS

,wct.F_DIST_RT(F,[Regression df],[Residual df]) as [Significance F]

FROM

mycte

UNION ALL

SELECT

'Residual'

,[Residual df]

,[Residual SS]

,[Residual SS]/[Residual df] as [Residual MS]

,NULL

FROM

mycte

UNION ALL

SELECT

'Total'

,[Regression df] + [Residual df]

,[Regression SS] + [Residual SS]

,NULL

FROM

mycte

This produces the following result.

Example #2

Using the same data as Example #1, if we wanted to calculate the coefficients with a y-intercept of 0 we would modify the function call to make the last parameter FALSE. This time, however, we will put the results into a temporary table, #L0, and reformat the results using the #L0 table.

SELECT

INTO

#L0

FROM

wct.LINEST(

'#xy', --@TableName

'*', --@ColumnNames

'', --@GroupedColumnName

NULL, --@GroupedColumnValue

6, --@Y_ColumnNumber

'False' --@Lconst

)

The #L0 table should contain the following data.

stat_name  idx         stat_val               col_name
---------- ----------- ---------------------- --------------
m          0           0                      Intercept
m          1           6.68653600898501       x1
m          2           10.669110550903        x2
m          3           11.3894353187733       x3
m          4           2.58577750738032       x4
m          5           -1.15990773149072      x5
se         0           0                      Intercept
se         1           4.38493471629212       x1
se         2           4.4380097835218        x2
se         3           3.28695713905425       x3
se         4           3.34008793898134       x4
se         5           5.30630339494569       x5
tstat      0           NULL                   Intercept
tstat      1           1.52488838297668       x1
tstat      2           2.40403042609709       x2
tstat      3           3.46503919489816       x3
tstat      4           0.774164499443965      x4
tstat      5           -0.218590541316492     x5
pval       0           NULL                   Intercept
pval       1           0.178129170463663      x1
pval       2           0.0530030468411133     x2
pval       3           0.0133846278949679     x3
pval       4           0.468231464499349      x4
pval       5           0.834214519463804      x5
rsq        NULL        0.978154399885257      NULL
sey        NULL        34.2979988105928       NULL
F          NULL        53.7309697924091       NULL
df         NULL        6                      NULL
ss_reg     NULL        316032.862965531       NULL
ss_resid   NULL        7058.11633446853       NULL
rsqm       NULL        0.989016885541019      NULL
rsqa       NULL        0.959949733122971      NULL

We can use the same technique as in Example #1 to reformat the coefficient statistics.

SELECT

d.col_name

,d.m

,d.se

,d.tstat

,d.pval

,m-wct.T_INV_2T(.05,m.stat_val)*se as [Lower Confidence Level]

,m+wct.T_INV_2T(.05,m.stat_val)*se as [Upper Confidence Level]

FROM

#L0 p

PIVOT(MAX(stat_val) FOR stat_name in(m,se,tstat,pval))d

CROSS JOIN #L0 m

WHERE

m.stat_name = 'df'

AND d.col_name IS NOT NULL

This produces the following result.

col_name       m                      se                     tstat                  pval                   Lower Confidence Level Upper Confidence Level
-------------- ---------------------- ---------------------- ---------------------- ---------------------- ---------------------- ----------------------
Intercept      0                      0                      NULL                   NULL                   0                      0
x1             6.68653600898501       4.38493471629212       1.52488838297668       0.178129170463663      -4.04301271480719      17.4160847327772
x2             10.669110550903        4.4380097835218        2.40403042609709       0.0530030468411133     -0.190308183893837     21.5285292856998
x3             11.3894353187733       3.28695713905425       3.46503919489816       0.0133846278949679     3.3465409410159        19.4323296965307
x4             2.58577750738032       3.34008793898134       0.774164499443965      0.468231464499349      -5.58712325437951      10.7586782691401
x5             -1.15990773149072      5.30630339494569       -0.218590541316492     0.834214519463804      -14.1439643943541      11.8241489313727

Note that because we specified a 0 intercept an intercept row is still created with a regression coefficient of 0 (keeping the results consistent with the Excel Data Analysis Regression). It is simple enough to exclude this from the output (like R does) simply by adding another conidition to the WHERE clause.

WHERE

m.stat_name = 'df'

AND d.col_name IS NOT NULL

AND d.col_name <> 'Intercept'

The following SQL can be used to reproduce the ANOVA table.

;WITH mycte as (

SELECT

df as [Residual df]

--,Obs - df - 1 as [Regression df]

,Obs - df as [Regression df]

,ss_reg as [Regression SS]

,ss_resid as [Residual SS]

FROM (

SELECT

stat_name

,stat_Val

FROM

#L0

WHERE

stat_name in('ss_reg','ss_resid','F','df')

UNION

SELECT

'Obs'

,COUNT(*)

FROM

#xy)d

PIVOT(MAX(stat_val) FOR stat_name in(df,F,ss_reg,ss_resid,Obs))p

)

SELECT

'Regression'

,[Regression df] as DF

,[Regression SS] as SS

,[Regression SS]/[Regression df] as MS

,wct.F_DIST_RT(F,[Regression df],[Residual df]) as [Significance F]

FROM

mycte

UNION ALL

SELECT

'Residual'

,[Residual df]

,[Residual SS]

,[Residual SS]/[Residual df] as [Residual MS]

,NULL

FROM

mycte

UNION ALL

SELECT

'Total'

,[Regression df] + [Residual df]

,[Regression SS] + [Residual SS]

,NULL

FROM

mycte

This produces the following result.

Note that the calculation of the regression degrees of freedom needs to be adjusted to reflect the 0 intercept. It also worthwhile noting that the calculation in this SQL produces a Significance F that agrees with R while Excel produces a different value (which seems to be caused by subtracting 1 from the residual degrees of freedom).

Example #3

Here’s another example. Instead of just having a matrix of x- and y-values in our table, we are going to add a column, testid, which is a way of grouping x- and y-values together for purposes of doing the LINEST calculation. This allows us to compute the ordinary least square values for multiple sets of data in a single SELECT statement.

Second, we take the x-values and raise them to the powers of 2, 3, 4, 5 thus solving for the equation:

y = B₀ + B₁x + B₂x² + B₃x³ + B₄x⁴ + B₅x⁵

SELECT

INTO

#xy

FROM (VALUES

('Wampler5',7590001,0)

,('Wampler5',-20479994,1)

,('Wampler5',20480063,2)

,('Wampler5',-20479636,3)

,('Wampler5',25231365,4)

,('Wampler5',-20476094,5)

,('Wampler5',20489331,6)

,('Wampler5',-20460392,7)

,('Wampler5',18417449,8)

,('Wampler5',-20413570,9)

,('Wampler5',20591111,10)

,('Wampler5',-20302844,11)

,('Wampler5',18651453,12)

,('Wampler5',-20077766,13)

,('Wampler5',21059195,14)

,('Wampler5',-19666384,15)

,('Wampler5',26348481,16)

,('Wampler5',-18971402,17)

,('Wampler5',22480719,18)

,('Wampler5',-17866340,19)

,('Wampler5',10958421,20)

,('Wampler4',75901,0)

,('Wampler4',-204794,1)

,('Wampler4',204863,2)

,('Wampler4',-204436,3)

,('Wampler4',253665,4)

,('Wampler4',-200894,5)

,('Wampler4',214131,6)

,('Wampler4',-185192,7)

,('Wampler4',221249,8)

,('Wampler4',-138370,9)

,('Wampler4',315911,10)

,('Wampler4',-27644,11)

,('Wampler4',455253,12)

,('Wampler4',197434,13)

,('Wampler4',783995,14)

,('Wampler4',608816,15)

,('Wampler4',1370781,16)

,('Wampler4',1303798,17)

,('Wampler4',2205519,18)

,('Wampler4',2408860,19)

,('Wampler4',3444321,20)

,('Wampler3',760,0)

,('Wampler3',-2042,1)

,('Wampler3',2111,2)

,('Wampler3',-1684,3)

,('Wampler3',3888,4)

,('Wampler3',1858,5)

,('Wampler3',11379,6)

,('Wampler3',17560,7)

,('Wampler3',39287,8)

,('Wampler3',64382,9)

,('Wampler3',113159,10)

,('Wampler3',175108,11)

,('Wampler3',273291,12)

,('Wampler3',400186,13)

,('Wampler3',581243,14)

,('Wampler3',811568,15)

,('Wampler3',1121004,16)

,('Wampler3',1506550,17)

,('Wampler3',2002767,18)

,('Wampler3',2611612,19)

,('Wampler3',3369180,20)

,('Wampler2',1,0)

,('Wampler2',1.11111,1)

,('Wampler2',1.24992,2)

,('Wampler2',1.42753,3)

,('Wampler2',1.65984,4)

,('Wampler2',1.96875,5)

,('Wampler2',2.38336,6)

,('Wampler2',2.94117,7)

,('Wampler2',3.68928,8)

,('Wampler2',4.68559,9)

,('Wampler2',6,10)

,('Wampler2',7.71561,11)

,('Wampler2',9.92992,12)

,('Wampler2',12.75603,13)

,('Wampler2',16.32384,14)

,('Wampler2',20.78125,15)

,('Wampler2',26.29536,16)

,('Wampler2',33.05367,17)

,('Wampler2',41.26528,18)

,('Wampler2',51.16209,19)

,('Wampler2',63,20)

,('Wampler1',1,0)

,('Wampler1',6,1)

,('Wampler1',63,2)

,('Wampler1',364,3)

,('Wampler1',1365,4)

,('Wampler1',3906,5)

,('Wampler1',9331,6)

,('Wampler1',19608,7)

,('Wampler1',37449,8)

,('Wampler1',66430,9)

,('Wampler1',111111,10)

,('Wampler1',177156,11)

,('Wampler1',271453,12)

,('Wampler1',402234,13)

,('Wampler1',579195,14)

,('Wampler1',813616,15)

,('Wampler1',1118481,16)

,('Wampler1',1508598,17)

,('Wampler1',2000719,18)

,('Wampler1',2613660,19)

,('Wampler1',3368421,20)

)n(testid,y,x)

Let’s say wanted to run LINEST for the all the data where the testid is equal to Wampler3. We could simply enter the following statement.

SELECT

FROM

wct.LINEST(

'#xy' --@TableName

,'y,x,POWER(x,2),POWER(x,3),POWER(x,4),POWER(x,5)' --@ColumnNames

,'testid' --@GroupedColumnName

,'Wampler3' --@GroupedColumnValue

,1 --@Y_ColumnNumber

,'True' --@Lconst

)

This produces the following result.

stat_name  idx         stat_val               col_name
---------- ----------- ---------------------- ---------------
m          0           1.00000000017179       Intercept
m          1           0.999999999679294      x
m          2           1.0000000000945        Column1
m          3           0.999999999989123      Column2
m          4           1.00000000000055       Column3
m          5           0.99999999999999       Column4
se         0           2152.32624678169       Intercept
se         1           2363.55173469678       x
se         2           779.343524331576       Column1
se         3           101.475507550349       Column2
se         4           5.64566512170747       Column3
se         5           0.112324854679311      Column4
tstat      0           0.000464613578757896   Intercept
tstat      1           0.000423092071563048   x
tstat      2           0.00128313121091521    Column1
tstat      3           0.00985459471087597    Column2
tstat      4           0.177127048530663      Column3
tstat      5           8.90274910975851       Column4
pval       0           0.999635414800478      Intercept
pval       1           0.999667996985275      x
pval       2           0.998993119116665      Column1
pval       3           0.992267170697082      Column2
pval       4           0.86177817377114       Column3
pval       5           2.25341849383758E-07   Column4
rsq        NULL        0.99999555902582       NULL
sey        NULL        2360.14502379269       NULL
F          NULL        675524.458240117       NULL
df         NULL        15                     NULL
ss_reg     NULL        18814317208116.7       NULL
ss_resid   NULL        83554268.0000007       NULL
rsqm       NULL        0.999997779510445      NULL
rsqa       NULL        0.999994078701093      NULL

You will notice that the column names for the calculated columns are Column1, Column2, Column3, and Column4. This is because we have not assigned them a name and these are the default column names. Even though Column1 does not refer to the first column in the resultant table from our dynamic SQL, it is the first column with no name. To get more descriptive column names, you can simply assign the names in the @ColumnNames variable as in this example.

SELECT

FROM

wct.LINEST(

'#xy' --@TableName

,'y,x,POWER(x,2) as [x^2],POWER(x,3) as [x^3],POWER(x,4) as [x^4],POWER(x,5) as [x^5]' --@ColumnNames

,'testid' --@GroupedColumnName

,'Wampler3' --@GroupedColumnValue

,1 --@Y_ColumnNumber

,'True' --@Lconst

)

This produces the following result.

stat_name  idx         stat_val               col_name
---------- ----------- ---------------------- ----------------
m          0           1.00000000017179       Intercept
m          1           0.999999999679294      x
m          2           1.0000000000945        x^2
m          3           0.999999999989123      x^3
m          4           1.00000000000055       x^4
m          5           0.99999999999999       x^5
se         0           2152.32624678169       Intercept
se         1           2363.55173469678       x
se         2           779.343524331576       x^2
se         3           101.475507550349       x^3
se         4           5.64566512170747       x^4
se         5           0.112324854679311      x^5
tstat      0           0.000464613578757896   Intercept
tstat      1           0.000423092071563048   x
tstat      2           0.00128313121091521    x^2
tstat      3           0.00985459471087597    x^3
tstat      4           0.177127048530663      x^4
tstat      5           8.90274910975851       x^5
pval       0           0.999635414800478      Intercept
pval       1           0.999667996985275      x
pval       2           0.998993119116665      x^2
pval       3           0.992267170697082      x^3
pval       4           0.86177817377114       x^4
pval       5           2.25341849383758E-07   x^5
rsq        NULL        0.99999555902582       NULL
sey        NULL        2360.14502379269       NULL
F          NULL        675524.458240117       NULL
df         NULL        15                     NULL
ss_reg     NULL        18814317208116.7       NULL
ss_resid   NULL        83554268.0000007       NULL
rsqm       NULL        0.999997779510445      NULL
rsqa       NULL        0.999994078701093      NULL

In the following SQL we return selected values for each test.

SELECT

p.testid

,p.rsq

,p.rsqa

,p.rsqm

,p.F

,p.df

,p.ss_reg

,p.ss_resid

FROM (

SELECT n.testid,k.stat_name,k.stat_val

FROM (SELECT DISTINCT testid FROM #xy)n

CROSS APPLY wct.LINEST('#xy','y,x,POWER(x,2) as [x^2],POWER(x,3) as [x^3],POWER(x,4) as [x^4],POWER(x,5) as [x^5]','testid',n.testid,1,'True') k

PIVOT(SUM(stat_val) FOR stat_name IN(F,df,ss_reg,ss_resid,rsq,rsqm,rsqa))p

This produces the following result.

testid   rsq                    rsqa                   rsqm                   F                      df                     ss_reg                 ss_resid
-------- ---------------------- ---------------------- ---------------------- ---------------------- ---------------------- ---------------------- ----------------------
Wampler1 1                      1                      1                      1.38198630394956E+32   15                     18814317208116.7       4.08419037605817E-19
Wampler2 1                      1                      1                      6.62975864402057E+31   15                     6602.91858365167       2.98785473417206E-28
Wampler3 0.99999555902582       0.999994078701093      0.999997779510445      675524.458240117       15                     18814317208116.7       83554268.0000007
Wampler4 0.957478440825662      0.94330458776755       0.978508273253559      67.5524458240123       15                     18814317208116.7       835542680000
Wampler5 0.0022466892157494     -0.330337747712334     0.0473992533248088     0.00675524458240124    15                     18814317208116.7       8.3554268E+15

In this SQL we select the coefficient statistics for each of the tests.

SELECT

p.testid

,p.col_name

,p.m

,p.se

,p.tstat

,p.pval

FROM (

SELECT

n.testid

,k.stat_name

,k.idx

,k.col_name

,k.stat_val

FROM (

SELECT DISTINCT testid

FROM #xy)n

CROSS APPLY wct.LINEST('#xy','y,x,POWER(x,2) as [x^2],POWER(x,3) as [x^3],POWER(x,4) as [x^4],POWER(x,5) as [x^5]','testid',n.testid,1,'True') k

WHERE k.idx IS NOT NULL

PIVOT(SUM(stat_val) FOR stat_name IN(m,se,tstat,pval))p

ORDER BY

testid

,idx

This produces the following result.

testid    col_name            m                      se                     tstat                  pval
--------- ------------------- ---------------------- ---------------------- ---------------------- ----------------------
Wampler1  Intercept           0.999999999993022      1.50479303077928E-10   6645432159.36582       6.15882032963405E-140
Wampler1  x                   0.999999999844435      1.65247075510795E-10   6051544311.77277       2.50814535309203E-139
Wampler1  x^2                 1.00000000004225       5.44875901481339E-11   18352802855.1745       1.4852856151972E-146
Wampler1  x^3                 0.999999999995625      7.09463246033833E-12   140951628655.325       7.78564309561968E-160
Wampler1  x^4                 1.00000000000021       3.94715138653456E-13   2533472628923.33       1.17924139944035E-178
Wampler1  x^5                 0.999999999999996      7.85316160862982E-15   127337249611812        3.57284346398899E-204
Wampler2  Intercept           0.999999999999995      4.0700853992358E-15    245695090375194        1.86809429176221E-208
Wampler2  x                   0.0999999999999979     4.46951637564801E-15   22373785348420.3       7.60771017143417E-193
Wampler2  x^2                 0.0100000000000011     1.47375180882262E-15   6785403037428.73       4.5051705244268E-185
Wampler2  x^3                 0.000999999999999853   1.91891904063468E-16   5211267275085.79       2.36154241344595E-183
Wampler2  x^4                 0.000100000000000008   1.06760483988871E-17   9366761582912.28       3.57687675436682E-187
Wampler2  x^5                 9.99999999999986E-06   2.12408203303352E-19   47079160995106.7       1.0837154071571E-197
Wampler3  Intercept           1.00000000017179       2152.32624678169       0.000464613578757896   0.999635414800478
Wampler3  x                   0.999999999679294      2363.55173469678       0.000423092071563048   0.999667996985275
Wampler3  x^2                 1.0000000000945        779.343524331576       0.00128313121091521    0.998993119116665
Wampler3  x^3                 0.999999999989123      101.475507550349       0.00985459471087597    0.992267170697082
Wampler3  x^4                 1.00000000000055       5.64566512170747       0.177127048530663      0.86177817377114
Wampler3  x^5                 0.99999999999999       0.112324854679311      8.90274910975851       2.25341849383758E-07
Wampler4  Intercept           1.00000001085174       215232.624678168       4.64613583719948E-06   0.999996354191052
Wampler4  x                   0.999999977160948      236355.173469677       4.23092062035716E-06   0.999996679970601
Wampler4  x^2                 1.00000000841373       77934.3524331573       1.28313122158987E-05   0.999989931165944
Wampler4  x^3                 0.999999998874462      10147.5507550348       9.85459469989148E-05   0.999922670373091
Wampler4  x^4                 1.00000000006258       564.566512170745       0.0017712704854165     0.99861007362054
Wampler4  x^5                 0.99999999999877       11.2324854679311       0.0890274910974769     0.930237861013585
Wampler5  Intercept           1.00000109952421       21523262.4678168       4.64614089531958E-08   0.999999967977304
Wampler5  x                   0.999997728516215      23635517.3469677       4.23091110651957E-08   0.999999967977304
Wampler5  x^2                 1.00000083489823       7793435.24331573       1.28313228207793E-07   0.999999898735342
Wampler5  x^3                 0.999999888409967      1014755.07550348       9.85459361130866E-07   0.999999226799412
Wampler5  x^4                 1.00000000620071       56456.6512170745       1.7712704962888E-05    0.999986100711028
Wampler5  x^5                 0.999999999878203      1123.2485467931        0.000890274910867431   0.999301395744685

Products

Support

Contact Us
FAQ’s
Blog
XLeratorDB Documentation
- Financial
- Financial-Options
- Statistics
- Math
- Engineering
- Strings
- Windowing
XLeratorDLL Documentation
- Financial-DLL
XLeratorDB Installation Guide

XLeratorDB function packages
for SQL Server

XLeratorDB Compilation packages
for SQL Server

XLeratorDLL function packages
Microsoft .NET API Library

XLeratorDB function packages for
SQL Server (2008 & later)

XLeratorDB Compilation packages for
SQL Server (2008 & later)

XLeratorDLL function packages
Microsoft .NET API Library

Legacy XLeratorDB Packages for
SQL Server 2005

XLeratorDB/statistics Documentation

SQL Server LINEST function

Products

Support

About

Pricing

XLeratorDB function packagesfor SQL Server

XLeratorDB Compilation packagesfor SQL Server

XLeratorDLL function packagesMicrosoft .NET API Library

XLeratorDB function packages for SQL Server (2008 & later)

XLeratorDB Compilation packages for SQL Server (2008 & later)

XLeratorDLL function packagesMicrosoft .NET API Library

Legacy XLeratorDB Packages for SQL Server 2005

XLeratorDB/statistics Documentation

SQL Server LINEST function

Products

Support

About

Pricing

XLeratorDB function packages
for SQL Server

XLeratorDB Compilation packages
for SQL Server

XLeratorDLL function packages
Microsoft .NET API Library

XLeratorDB function packages for
SQL Server (2008 & later)

XLeratorDB Compilation packages for
SQL Server (2008 & later)

XLeratorDLL function packages
Microsoft .NET API Library

Legacy XLeratorDB Packages for
SQL Server 2005