原文:SQL server 2005 PIVOT运算符的使用
PIVOT,UNPIVOT运算符是SQL server 2005支持的新功能之一,主要用来实现行到列的转换。本文主要介绍PIVOT运算符的操作,以及如何实现动态PIVOT的行列转换。
关于UNPIVOT及SQL server 2000下的行列转换请参照本人的其它文章。
一、PIVOT的语法
SELECT
[non-pivoted column], -- optional
[additional non-pivoted columns], -- optional
[first pivoted column],
[additional pivoted columns]
FROM (
SELECT query producing sql data for pivot
-- select pivot columns as dimensions and
-- value columns as measures from sql tables
) AS TableAlias
PIVOT
( <aggregation function>(column for aggregation or measure column) -- MIN,MAX,SUM,etc FOR []
IN (
[first pivoted column], ..., [last pivoted column]
)
) AS PivotTableAlias
ORDER BY clause – optional二、PIVOT的使用例子
1. 静态PIVOT的用法
为演示,从NorthWind数据库中提取一些记录生成新的Orders表,然后使用PIVOT将行转换到列。
USE tempdb
GO
SELECT YEAR(OrderDate) AS [Year]
,CustomerID
,od.Quantity
INTO dbo.Orders
FROM NorthWind..Orders AS o
JOIN NorthWind..[Order Details] AS od
ON o.OrderID = od.OrderID
WHERE o.CustomerID IN ('BONAP','BOTTM','ANTON')
SELECT CustomerID
,[1996],[1997],[1998]
FROM dbo.Orders
PIVOT (
SUM(Quantity)
FOR [Year] IN ([1996],[1997],[1998])
)x
/*
TSQL中pivot的结构:
● 用于生成pivot数据源的源表,作为一个输入表
● pivot表
● 聚合列及透视列的选择 TSQL中pivot的实现:
1->上例中Orders表相当于是一个输入表。包含了CustomerID,[Year],Quantity 三个列。
Year是透视列,用于生成维度。
pivot首先将聚合列之外的列进行分组,并对其实现聚合。本列中则是对聚合列Quantity之外的列先实现分组,
即对CustomerID,Year进行分组,并对其Quantity实现聚合,相当于先做如下处理:
*/
SELECT CustomerID
,[Year]
,SUM(Quantity) AS Total
FROM dbo.Orders
GROUP BY CustomerID
,[Year]
ORDER BY CustomerID /* Result:
CustomerID Year Total
---------- ----------- -----------
ANTON 1996 24
ANTON 1997 295
ANTON 1998 40
BONAP 1996 181
BONAP 1997 486
BONAP 1998 313
BOTTM 1996 81
BOTTM 1997 454
BOTTM 1998 421
*/
/*
2->pivot根据FOR [Year] IN子句中的值,在结果集中来建立对应的新列,本例中即是列,,
对于新列,,中的取值,取中间结果集中与之相对应的值。
如对于客户ANTON,1996列中的值就选择中间结果中对应的Total值,同理列中为。
并将中间结果pivot表命名为x。 3->最外层的SELECT语句从pivot表生成最终结果,此处因Orders表仅有列,故直接将结果用一个SELECT返回,有嵌套的SELECT参照下例。 --结果:
CustomerID 1996 1997 1998
---------- ----------- ----------- -----------
ANTON 24 295 40
BONAP 181 486 313
BOTTM 81 454 421
*/ 以下是为输入表多于一列的例子,数据来源于SQL server 2005的AdventureWorks,其实现的原理同上。
SELECT *
FROM(
SELECT YEAR(DueDate) [Year]
,CASE MONTH(DueDate)
WHEN 1 THEN 'January'
WHEN 2 THEN 'February'
WHEN 3 THEN 'March'
WHEN 4 THEN 'April'
WHEN 5 THEN 'May'
WHEN 6 THEN 'June'
WHEN 7 THEN 'July'
WHEN 8 THEN 'August'
WHEN 9 THEN 'September'
WHEN 10 THEN 'October'
WHEN 11 THEN 'November'
WHEN 12 THEN 'December'
END as [Month]
,ProductID
,OrderQty
FROM Production.WorkOrder
)WorkOrder
PIVOT (
SUM(OrderQty)
FOR [Month] IN ([January],[February],[March],[April],[May],[June],[July],[August],[September],[October],[November],[December])
)x
ORDER BY [Year], ProductID
--Result: 末尾部分省略
/*
Year ProductID January February March April May June July August
----------- ----------- ----------- ----------- ----------- ----------- ----------- ----------- ----------- -----------
2002 3 8480 16870 12960 9530 19390 14170 26200 35870
2002 316 1842 3704 2910 2252 4738 3496 7624 10778
2002 324 1842 3704 2910 2252 4738 3496 7546 10600
2002 327 921 1852 1455 1126 2369 1748 3773 5300
2002 328 414 1048 872 458 1272 992 1786 2632
*/
2. 动态PIVOT的使用
USE AdventureWorks;
GO --第一种生成透视列的方法,使用了COALESCE来联接字符串
DECLARE @PivotColHeader VARCHAR(MAX)
SELECT @PivotColHeader =
COALESCE(@PivotColHeader + ',[' + cast(Name as varchar) + ']',
'[' + cast(Name as varchar) + ']') --示例中Name转换为varchar或char类型,注意:在CAST 和CONVERT 中使用varchar 时,显示n的默认值为30
FROM Sales.SalesTerritory
GROUP BY Name /*
--第二种生成透视列的方法,使用了FOR XML PATH方法
SELECT @PivotColHeader =
STUFF(
(
SELECT DISTINCT ',[' + cast(Name as varchar) + ']'
FROM Sales.SalesTerritory
FOR XML PATH('')
),
1,1,'')
*/ DECLARE @PivotTableSQL NVARCHAR(MAX)
SET @PivotTableSQL = N'
SELECT *
FROM (
SELECT YEAR(H.OrderDate) [Year]
,T.Name
,H.TotalDue
FROM Sales.SalesOrderHeader H
LEFT JOIN Sales.SalesTerritory T
ON H.TerritoryID = T.TerritoryID
)AS PivotData
PIVOT(
SUM(TotalDue)
FOR Name IN (
' + @PivotColHeader + '
)
) AS x '
EXECUTE sp_executesql @PivotTableSQL --Result:部分结果省略
/*
Year Australia Canada Central France Germany Northeast
----------- --------------------- --------------------- --------------------- --------------------- --------------------- ---------------------
2001 1446497.1744 2173647.1453 1263884.1024 199531.723 262752.4184 754833.2045
2002 2380484.8387 7215430.5017 3518185.4756 1717145.7439 575960.0974 3275322.1694
2003 4547123.2777 8186021.9178 4015356.874 4366078.3475 2714826.4297 3833030.25
2004 3823410.2386 3926712.8926 1771532.7396 2853948.6596 2386224.5508 1406555.6861 */
对该动态pivot增加汇总列
DECLARE @PivotColHeader VARCHAR(MAX)
DECLARE @TotalCol VARCHAR(MAX) SELECT @PivotColHeader = --使用COALESCE函数生成列标题
COALESCE(@PivotColHeader + ',[' + cast(Name as varchar) + ']',
'[' + cast(Name as varchar) + ']')
,
@TotalCol = COALESCE(@TotalCol + ', SUM([' + cast(Name as varchar) + ']) AS [' + cast(Name as varchar) + ']'
,'SUM([' + cast(Name as varchar) + ']) AS [' + cast(Name as varchar) + ']') --使用COALESCE函数生成汇总字符串
FROM Sales.SalesTerritory DECLARE @PivotTableSQL NVARCHAR(MAX)
SET @PivotTableSQL = N'
SELECT *
FROM (
SELECT CAST(YEAR(H.OrderDate) AS CHAR(4)) [Year]
,T.Name
,H.TotalDue
FROM Sales.SalesOrderHeader H
LEFT JOIN Sales.SalesTerritory T
ON H.TerritoryID = T.TerritoryID
)AS PivotData
PIVOT(
SUM(TotalDue)
FOR Name IN (
' + @PivotColHeader + '
)
) AS x
UNION
SELECT ''GrandTotal'', ' + @TotalCol + '
FROM (
SELECT CAST(YEAR(H.OrderDate) AS CHAR(4)) [Year]
,T.Name
,H.TotalDue
FROM Sales.SalesOrderHeader H
LEFT JOIN Sales.SalesTerritory T
ON H.TerritoryID = T.TerritoryID
) AS PivotData
PIVOT(
SUM(TotalDue)
FOR Name IN (
' + @PivotColHeader + '
)
) AS y '
--PRINT @PivotTableSQL
EXECUTE sp_executesql @PivotTableSQL --Result:部分结果省略
/*
Year Australia Canada Central France Germany Northeast Northwest
---------- --------------------- --------------------- --------------------- --------------------- --------------------- --------------------- ---------------------
2001 1446497.1744 2173647.1453 1263884.1024 199531.723 262752.4184 754833.2045 2703481.7947
2002 2380484.8387 7215430.5017 3518185.4756 1717145.7439 575960.0974 3275322.1694 5651688.6685
2003 4547123.2777 8186021.9178 4015356.874 4366078.3475 2714826.4297 3833030.25 7494658.0357
2004 3823410.2386 3926712.8926 1771532.7396 2853948.6596 2386224.5508 1406555.6861 4952772.2793
GrandTotal 12197515.5294 21501812.4574 10568959.1916 9136704.474 5939763.4963 9269741.31 20802600.7782
*/
生成汇总列的注意事项;
1->使用COALESCE函数生成列标题 。
2->使用COALESCE函数生成带有SUM求和函数并且指定了别名的字符串。
3->使用UNION对两个SELECT来实现联接。且将[Year]转换为字符串,因为YEAR(H.OrderDate)得值为 INT ,而''GrandTotal''为字符串,UNION 或UNION ALL使用时必须列的数量和类型相对应。