all groups > sql server odbc > october 2003 >
You're in the

sql server odbc

group:

SubQuerying Vs Joining


SubQuerying Vs Joining Awah Teh
10/29/2003 3:49:19 PM
sql server odbc:
Which is most effective (consider the query below for an example)?
Please give me some explanation (FMI:) as to why one is faster over the
other.


--Both Queries are designed to get the users information of users that
received passes in the Year of 2003

--** ***************
--** QUERY 1
--** ***************
Select * from users where user_id in (select user_id from
users_site_passes where date_pass_issued >= 'January 1, 2003')


--** ***************
--** QUERY 2
--** ***************
Select users.* from users, users_site_passes where users.user_id =
users_site_passes.user_id and users_site_passes.date_pass_issued >= 'January
1, 2003'

Thanks In Advance
A-

--
Awah Teh
Chief Executive Officer
DigicentriQ Technologies, LLC
awaht@digicentriq.com
www.digicentriq.com
877 675 4742
805 732 9421

Re: SubQuerying Vs Joining Rich Dillon
10/29/2003 4:50:16 PM
Awah,

I assume that you have a one-to-many relationship here; that <user_id> is a
key for <users> and that <users_site_passes> may have several rows for any
one user. In that case your queries aren't equivelant in the first place.
These three are.

SELECT *
FROM users
WHERE user_id IN (
SELECT user_id
FROM users_site_passes
WHERE date_pass_issued >= '2003-01-01');

SELECT *
FROM users AS u
WHERE EXISTS (
SELECT *
FROM users_site_passes
WHERE user_id=u.user_id AND date_pass_ussued >= '2003-01-01');

SELECT DISTINCT u.*
FROM users AS u JOIN users_site_passes AS p ON u.user_id=p.user_id
WHERE p.date_pass_issued >= '2003-01-01';

Since these three queries define the same result, an ideal optimizer would
produce the same execution plan for each. There's no good reason that it
should produce a better plan for one than for the others. In practice, SQL
Server often does. Have a look at the execution plans in QA to see which
works out best in your environment. Keep in mind, though, that the
difference you see isn't the necessary result of any law of nature but,
rather, an accident of the current state of SQL Server development and of
the particulars of your environment.


Hope that helps,
Rich


[quoted text, click to view]

Re: SubQuerying Vs Joining Andrew J. Kelly
10/29/2003 7:01:48 PM
Awah,

First off the one that is most likely to be fastest is not shown. This
would be an EXISTS statement.

Select * from users AS b where EXISTS (select * from
users_site_passes AS a where a.User_id = b.User_ID AND a.date_pass_issued
[quoted text, click to view]

Exists will stop looking after the first match is found where as the other
two might do more work. To answer your original question though they may in
fact be the same. Sometimes the optimizer will create similar plans on
queries such as these. Here are some general comments though:

Get in the habit of using the ANSI JOIN syntax like this:

Select users.* from users INNER JOIN users_site_passes
ON users.user_id = users_site_passes.user_id and
users_site_passes.date_pass_issued

When dealing with DATE strings you should use the ANSI syntax as well to
avoid issues where sql server can misinterpret the date. It goes like
this: 'yyyymmdd' Always has 8 chars and no dashes, slashes etc.



--

Andrew J. Kelly
SQL Server MVP


[quoted text, click to view]

AddThis Social Bookmark Button