Home

XLeratorDB function packages for SQL Server
financial view documentation pricing
statistics view documentation pricing
math view documentation pricing
engineering view documentation pricing
strings view documentation pricing
financial-options view documentation pricing
windowing view documentation pricing

XLeratorDB Compilation packages for SQL Server
Suite incl: financial, statistics, math, engineering & strings pricing
Suite (Developer) requires SQL Server Developer Edition pricing
Suite (Subscription) One-year non-recurring license pricing

SuitePLUS incl: all Suite packages PLUS financial-options pricing
SuitePLUS (Developer) requires SQL Server Developer Edition, also incl: financial-options pricing
SuitePLUS (Subscription) One-year non-recurring license, also incl: financial-options pricing

XLeratorDLL function packages Microsoft .NET API Library
financial (DLL) view documentation pricing SQL Server not required

View All Product Pricing ...

Download Free 15 Day Trial ...

Documentation

Purchase

XLeratorDB function packages for SQL Server (2008 & later)
financial
statistics
math
engineering
strings
financial-options
windowing

XLeratorDB Compilation packages for SQL Server (2008 & later)
Suite
Suite (Developer)
Suite (Subscription)

SuitePLUS
SuitePLUS (Developer)
SuitePLUS (Subscription)

XLeratorDLL function packages Microsoft .NET API Library
financial (DLL)

Legacy XLeratorDB Packages for SQL Server 2005
financial for SQL Server 2005 only
statistics for SQL Server 2005 only
math for SQL Server 2005 only

Suite for SQL Server 2005 only
Suite (Developer) for SQL Server 2005 only
SuitePLUS for SQL Server 2005 only
SuitePLUS (Developer) for SQL Server 2005 only

Download Trial
Case Studies
Blog
Support

XLeratorDB/statistics Documentation

SQL Server chi-squared statistic for normalized data

CHISQN

Updated: 28 February 2011

Note: This documentation is for the SQL2008 (and later) version of this XLeratorDB function, it is not compatible with SQL Server 2005.
Click here for the SQL2005 version of the CHISQN function

Use the aggregate CHISQN to calculate the chi-square (χ²) statistic for normalized tables. This function calculates the chi-square statistic by finding the difference between each observed and theoretical frequency for each possible outcome, squaring them, dividing each by the theoretical frequency, and taking the sum of the results. A second important part of determining the test statistic is to define the degrees of freedom of the test: this is essentially the number of squares errors involving the observed frequencies adjusted for the effect of using some of those observations to define the expected frequencies.

CHISQN automatically calculates the expected results.

The value of the chi-square statistic is:

Where

r is the number of rows

c is the number of columns

O is the Observed result

E is the Expected result

Syntax

Arguments

@Expr1

the name or number of the rows.

@Expr2

the name or number of the columns.

@Obs

the observed value. @Obs is an expression of type float or of a type that can be implicitly converted to float.

Return Types

float

Remarks

· CHISQN is designed for normalized tables.

· CHISQN automatically calculates the expected values. If you want to enter the expected values use the CHISQN2 function.

· CHISQN is an aggregate function and follows the same conventions as all other aggregate functions in SQL Server.

· If you have previously used the CHISQN scalar function, the CHISQN aggregate has a different syntax. The CHISQN scalar function is no longer available in XLeratorDB/statistics2008, though you can still use the scalar CHISQN_q.

Examples

In this hypothetical situation, we want to determine if there is an association between population density and the preference for a sport from among baseball, football, and basketball. We will use the CHISQN function to calculate the chi-square statistic.

SELECT wct.CHISQN(

sport --@expr1

,locale --@expr2

,result --@obs

) as CHISQ

FROM (VALUES

('Basketball', 'Rural', 28),

('Basketball', 'Suburban', 35),

('Basketball', 'Urban', 54),

('Baseball', 'Rural', 60),

('Baseball', 'Suburban', 43),

('Baseball', 'Urban', 35),

('Football', 'Rural', 52),

('Football', 'Suburban', 48),

('Football', 'Urban', 28)

) n(sport, locale, result)

This produces the following result.

CHISQ

----------------------

22.451703426585

(1 row(s) affected)

What if the data had not been normalized? We could use that following SQL to take de-normalized data (the way it might be in a spreadsheet) and turn it into normalized data.

SELECT wct.CHISQN(n.sport, x.locale, x.result) as CHISQ

FROM (VALUES

('Basketball', 28, 35, 54),

('Baseball', 60, 43, 35),

('Football', 52, 48, 28)

) n(sport, rural, suburban, urban)

CROSS APPLY(VALUES

('rural', rural),

('suburban', suburban),

('urban',urban)

) x(locale, result)

This produces the following result.

CHISQ

----------------------

22.451703426585

(1 row(s) affected)

In this example, the de-normalized table contains another column, region. We will calculate the chi-square statistic for each region.

SELECT n.region,

wct.CHISQN(n.sport, x.locale, x.result) as CHISQ

FROM (VALUES

('Midwest','Baseball',27,38,36),

('Midwest','Basketball',46,33,31),

('Midwest','Football',43,29,36),

('Northeast','Baseball',39,46,42),

('Northeast','Basketball',42,34,36),

('Northeast','Football',35,30,41),

('Southeast','Baseball',31,36,33),

('Southeast','Basketball',32,39,34),

('Southeast','Football',31,24,40),

('Southwest','Baseball',44,19,34),

('Southwest','Basketball',37,42,24),

('Southwest','Football',41,29,36),

('West','Baseball',43,33,30),

('West','Basketball',44,38,34),

('West','Football',30,36,30)

) n(region, sport, urban, rural, suburban)

CROSS APPLY(VALUES

('rural', rural),

('suburban', suburban),

('urban',urban)

) x(locale, result)

GROUP BY region

This produces the following result.

region CHISQ

--------- ----------------------

Midwest 6.82237540453074

Northeast 2.88288327437734

Southeast 4.24269415109385

Southwest 11.8327601855115

West 2.05463121587102

(5 row(s) affected)

View Topic History

Products

Support

Contact Us
FAQ’s
Blog
XLeratorDB Documentation
- Financial
- Financial-Options
- Statistics
- Math
- Engineering
- Strings
- Windowing
XLeratorDLL Documentation
- Financial-DLL
XLeratorDB Installation Guide

XLeratorDB function packages
for SQL Server

XLeratorDB Compilation packages
for SQL Server

XLeratorDLL function packages
Microsoft .NET API Library

XLeratorDB function packages for
SQL Server (2008 & later)

XLeratorDB Compilation packages for
SQL Server (2008 & later)

XLeratorDLL function packages
Microsoft .NET API Library

Legacy XLeratorDB Packages for
SQL Server 2005

XLeratorDB/statistics Documentation

SQL Server chi-squared statistic for normalized data

Products

Support

About

Pricing

XLeratorDB function packagesfor SQL Server

XLeratorDB Compilation packagesfor SQL Server

XLeratorDLL function packagesMicrosoft .NET API Library

XLeratorDB function packages for SQL Server (2008 & later)

XLeratorDB Compilation packages for SQL Server (2008 & later)

XLeratorDLL function packagesMicrosoft .NET API Library

Legacy XLeratorDB Packages for SQL Server 2005

XLeratorDB/statistics Documentation

SQL Server chi-squared statistic for normalized data

Products

Support

About

Pricing

XLeratorDB function packages
for SQL Server

XLeratorDB Compilation packages
for SQL Server

XLeratorDLL function packages
Microsoft .NET API Library

XLeratorDB function packages for
SQL Server (2008 & later)

XLeratorDB Compilation packages for
SQL Server (2008 & later)

XLeratorDLL function packages
Microsoft .NET API Library

Legacy XLeratorDB Packages for
SQL Server 2005