100万条数据的表如何快速随机抽取10条数据

select top 10 * from tablename order by newid()
这个速度太慢了，要等半分钟。有没有快速点的办法。

表的主键是varchar类型的，主键是字符串而不是int

11 个解决方案

#1

set nocount on--75151448行数据
select count(*) from syscolumns a,syscolumns b,syscolumns c 

set statistics time on 

select
 top 10 * from(select a.name as name,left(b.name,10) as name1, checksum(newid()) as hash    from syscolumns a,syscolumns b,syscolumns c )t 
where
 hash%100=0

set statistics time off

set nocount off

#2

数据量太大，每次都order by newid()效率上确实会有问题。

#3

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

#4

直接 top 10 ,不用order by 应该最快了

#5

引用 4 楼 hyrongg 的回复:

直接 top 10 ,不用order by 应该最快了

要随机抽取

#6

引用 3 楼 wufeng4552 的回复:

SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

#7

引用 6 楼 wxz280973534 的回复:

引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

你是2000的

#8

引用 6 楼 wxz280973534 的回复:

引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

既然是随即
我觉得可以没必要在100W里随即
可以在前10或者20W里
再查随即看看

#9

引用 1 楼 fredrickhu 的回复:

SQL code

set nocount on--75151448行数据
select count(*) from syscolumns a,syscolumns b,syscolumns c

set statistics time on

select
top 10 * from(select a.name as name,left(b.name,10) as name……

这个是什么意思啊？看不懂，太深奥了，
我的比如表名是 T1列名是C1，要怎么写sql语句

#10

引用 7 楼 wufeng4552 的回复:

引用 6 楼 wxz280973534 的回复:
引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

你是2000的

2005的数据库

#11

引用 10 楼 wxz280973534 的回复:

引用 7 楼 wufeng4552 的回复:
引用 6 楼 wxz280973534 的回复:
引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

你是2000的

20……

那应该没语法错误
看看你数据库的兼容级别是不是80的了
如果是80的就有语法错误

#1

set nocount on--75151448行数据
select count(*) from syscolumns a,syscolumns b,syscolumns c 

set statistics time on 

select
 top 10 * from(select a.name as name,left(b.name,10) as name1, checksum(newid()) as hash    from syscolumns a,syscolumns b,syscolumns c )t 
where
 hash%100=0

set statistics time off

set nocount off

#2

数据量太大，每次都order by newid()效率上确实会有问题。

#3

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

#4

直接 top 10 ,不用order by 应该最快了

#5

引用 4 楼 hyrongg 的回复:

直接 top 10 ,不用order by 应该最快了

要随机抽取

#6

引用 3 楼 wufeng4552 的回复:

SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

#7

引用 6 楼 wxz280973534 的回复:

引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

你是2000的

#8

引用 6 楼 wxz280973534 的回复:

引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

既然是随即
我觉得可以没必要在100W里随即
可以在前10或者20W里
再查随即看看

#9

引用 1 楼 fredrickhu 的回复:

SQL code

set nocount on--75151448行数据
select count(*) from syscolumns a,syscolumns b,syscolumns c

set statistics time on

select
top 10 * from(select a.name as name,left(b.name,10) as name……

这个是什么意思啊？看不懂，太深奥了，
我的比如表名是 T1列名是C1，要怎么写sql语句

#10

引用 7 楼 wufeng4552 的回复:

引用 6 楼 wxz280973534 的回复:
引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

你是2000的

2005的数据库

#11

引用 10 楼 wxz280973534 的回复:

引用 7 楼 wufeng4552 的回复:
引用 6 楼 wxz280973534 的回复:
引用 3 楼 wufeng4552 的回复:
SQL code

--try
select top 10 *
from tablename tablesample(1000 rows)
order by newid()

说rows附近有语法错误

你是2000的

20……

那应该没语法错误
看看你数据库的兼容级别是不是80的了
如果是80的就有语法错误

秒客网

100万条数据的表如何快速随机抽取10条数据

11 个解决方案

#1

#2

#3

#4

#5

#6

#7

#8

#9

#10

#11

#1

#2

#3

#4

#5

#6

#7

#8

#9

#10

#11

相关文章