INF: Semantics of SELECT DISTINCT with ORDER BY

Last reviewed: September 9, 1996
Article ID: Q125324
The information in this article applies to:
  • Microsoft Open Database Connectivity, version 2.0

SUMMARY

The information in this article applies to ODBC Desktop Database Drivers version 2.0. A SELECT clause that has the DISTINCT modifier in the select list and has an ORDER BY and the ORDER BY clause has columns that are not present in the select list, causes an error due to ambiguous semantics. The article discusses why this occurs.

MORE INFORMATION

ODBC Desktop Database drivers are single-tier drivers. They are written as a thin layer on top of the JET database engine. The JET engine is the SQL engine that Microsoft Access and Microsoft Visual Basic use to get to dBASE, Paradox, FoxPro, Btrieve, Access, Excel, and Text file formats.

When a SELECT statement is executed, the JET engine always performs an implicit sort if a DISTINCT modifier is present; an ORDER BY clause causes another sort to be done, which occurs after all other sorts (if any).

As an extension to ANSI 89 SQL, the JET engine allows you to ORDER BY columns that do not appear in the SELECT list. Thus, a SELECT statement of the form

   SELECT x FROM test_table
   ORDER BY y

is a valid SQL statement. On this statement, the engine does a sort on the result set and throws out the columns that are not selected.

However, the semantics of a statement of the form

   SELECT DISTINCT x FROM test_table
   ORDER BY y

are non-deterministic. The engine does an implicit sort for the SELECT DISTINCT. If it has the pairs (x0,y1),(x0,y2), one of them has to go because of the DISTINCT x. But which one should go?

The decision that the engine makes will affect where in the sort order (x0) appears, because the sort for the ORDER BY clause is always done last. Because of the non-determinism in the semantics of a SELECT statement, when you try to execute it, you will generate the error:

   [Microsoft][ODBC Microsoft Access 2.0 Driver] ORDER BY clause
   (y) conflicts with DISTINCT
   SQLSTATE = "22005", NativeError = -3015

Note that what you want might be obtained by the following query:

   SELECT DISTINCT x,y FROM test_table
   ORDER BY y

This will cause duplicate values of x that you would not otherwise see had you not included y in the select list. However, in most cases, this is a very acceptable workaround.


Additional reference words: 2.00.2317 dBASE Paradox Btrieve Access
FoxPro Text Excel SQL 3015 Windows NT
KBCategory: kbprg
KBSubcategory:



THE INFORMATION PROVIDED IN THE MICROSOFT KNOWLEDGE BASE IS PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND. MICROSOFT DISCLAIMS ALL WARRANTIES, EITHER EXPRESS OR IMPLIED, INCLUDING THE WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. IN NO EVENT SHALL MICROSOFT CORPORATION OR ITS SUPPLIERS BE LIABLE FOR ANY DAMAGES WHATSOEVER INCLUDING DIRECT, INDIRECT, INCIDENTAL, CONSEQUENTIAL, LOSS OF BUSINESS PROFITS OR SPECIAL DAMAGES, EVEN IF MICROSOFT CORPORATION OR ITS SUPPLIERS HAVE BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. SOME STATES DO NOT ALLOW THE EXCLUSION OR LIMITATION OF LIABILITY FOR CONSEQUENTIAL OR INCIDENTAL DAMAGES SO THE FOREGOING LIMITATION MAY NOT APPLY.

Last reviewed: September 9, 1996
© 1998 Microsoft Corporation. All rights reserved. Terms of Use.