Login     Register

        Contact Us     Search

XLeratorDB/math Documentation

SQL Server correlation matrix function


CORRM

Updated: 05 March 2015


Use the table-valued function CORRM to calculate a correlation matrix. Given a matrix Am,n and a covariance matrix Cn,n = COVM (Am,n), then each element in correlation matrix Rn,n is calculated as:

For example:


Syntax
SELECT * FROM [wct].[CORRM](
  <@Matrix_RangeQuery, nvarchar(max),>
 ,<@Is3N, bit,>)
Arguments
@Matrix_RangeQuery
the SELECT statement, as text, used to return the input matrix for this function. The SELECT statement specifies the column names from the table or view or can be used to enter the matrix values directly. Data returned from the @Matrix_RangeQuery select must be of the type float or of a type that implicitly converts to float.
@Is3N
a bit value identifying the form for the resultant table returned by @Matrix_RangeQuery. Enter 'True' for a resultant table in 3rd normal form. Enter 'False' for a de-normalized table in 'spreadsheet' form.
Return Types
TABLE (
       [RowNum] [int] NULL,
       [ColNum] [int] NULL,
       [ItemValue] [float] NULL
Remarks
·         If @Is3N is NULL then @Is3N = 'False'.
·         Use MCORR for a matrix stored as a string.
·         If @Is3N is'True' then the result table should be returned as row, column, and value.
·         If the array contains NULL, then NULL will be returned.
·         The function returns an error if the array contains a non-numeric value.
·         If the supplied input is a vector, then result will be the (sample) variance.
Examples
In this example @Matrix_RangeQuery returns the matrix from a derived table embodied in the statement. The matrix is in spreadsheet form.
SELECT
   *
FROM
   wct.CORRM(
       'SELECT
          x1,x2,x3,x4,x5,x6,x7,x8
        FROM (VALUES
           (1,1,1,1,1,1,1,1)
          ,(2,3,4,5,6,7,8,9)
          ,(4,9,16,25,36,49,64,81)
        )n(x1,x2,x3,x4,x5,x6,x7,x8)'
        ,'False'
        )
This produces the following result.
 


Note that the results are returned in third-normal form. If we wanted to a more traditional (de-normalized) presentation of the results, we can us the PIVOT function. Also note that we can use * to select all the columns. We have rounded the results to 4 decimal place for ease of viewing.
SELECT
   ROUND([0],4) as [0],
   ROUND([1],4) as [1],
   ROUND([2],4) as [2],
   ROUND([3],4) as [3],
   ROUND([4],4) as [4],
   ROUND([5],4) as [5],
   ROUND([6],4) as [6],
   ROUND([7],4) as [7]
FROM (
SELECT
   *
FROM
   wct.CORRM(
       'SELECT
          *
        FROM (VALUES
           (1,1,1,1,1,1,1,1)
          ,(2,3,4,5,6,7,8,9)
          ,(4,9,16,25,36,49,64,81)
        )n(x1,x2,x3,x4,x5,x6,x7,x8)'
        ,'False'
        )
   ) d
PIVOT(SUM(ItemValue) for ColNum in([0],[1],[2],[3],[4],[5],[6],[7])) as P
This produces the following result.
 


Let's put several matrices into a table and calculate the correlation matrix for each. We will use CROSS APPLY to calculate to the correlation matrix for each matrix. Note that we have to convert the matrix identifier (which is defined as int) to a varchar to include it in the WHERE clause of @Matrix_RangeQuery.
CREATE TABLE #c(
   Matrix int,
   rn     int,
   x1     float,
   x2     float,
   x3     float,
   PRIMARY KEY (Matrix, rn)
   )
  
INSERT INTO #c VALUES (100,1,-11,-41,36)
INSERT INTO #c VALUES (100,2,-31,41,-47)
INSERT INTO #c VALUES (100,3,48,-38,33)
INSERT INTO #c VALUES (100,4,8,44,-10)
INSERT INTO #c VALUES (101,1,39,6,-7)
INSERT INTO #c VALUES (101,2,33,-49,16)
INSERT INTO #c VALUES (101,3,14,29,13)
INSERT INTO #c VALUES (101,4,35,-38,-50)
INSERT INTO #c VALUES (101,5,9,-32,-25)
INSERT INTO #c VALUES (102,1,29,49,-17)
INSERT INTO #c VALUES (102,2,35,28,28)
INSERT INTO #c VALUES (102,3,-34,-29,-49)
INSERT INTO #c VALUES (102,4,0,-5,0)
INSERT INTO #c VALUES (102,5,-17,14,24)
INSERT INTO #c VALUES (102,6,44,3,-23)
 
SELECT
   n.MATRIX
   ,k.*
FROM
   (SELECT DISTINCT MATRIX FROM #c)n
CROSS APPLY
   wct.CORRM(
       'SELECT
          x1,x2,x3
        FROM
          #c
       WHERE MATRIX = ' + cast(n.Matrix as varchar(max)) + ' ORDER by rn'
       ,'False'
       )k
This produces the following result.
 


In this example we calculate the correlation matrix from a derived table in 3rd normal form.
SELECT
   *
FROM
   wct.CORRM(
       'SELECT
          rownum,colnum,itemvalue
        FROM (VALUES
               (0,0,1),(0,1,1),(0,2,1),(0,3,1),(0,4,1),(0,5,1),(0,6,1),(0,7,1)
              ,(1,0,2),(1,1,3),(1,2,4),(1,3,5),(1,4,6),(1,5,7),(1,6,8),(1,7,9)
              ,(2,0,4),(2,1,9),(2,2,16),(2,3,25),(2,4,36),(2,5,49),(2,6,64),(2,7,81)
        )n(rownum,colnum,itemvalue)'
        ,'True'
        )
This produces the following result.

 

See Also

 



Copyright 2008-2024 Westclintech LLC         Privacy Policy        Terms of Service