What is a promise in Javascript?

Question

Asked: 2020-01-21 12:15:26 +0800 CST 2020-01-21 12:15:26 +0800 CST 2020-01-21 12:15:26 +0800 CST

Performance difference when querying with SELECT DISTINCT and GROUP BY?

772

I am reviewing and learning SQL, there is something that I notice that seems curious to me.

Suppose I have a table called productosand one of its fields is categoria, when doing the following queries I see that the result is the same:

SELECT DISTINCT categoria FROM productos;

Y

SELECT categoria FROM productos GROUP BY categoria;

The difference that I notice is that with DISTINCTme it filters the duplicates and respects the order in which they appear, while with the sentence that it uses it GROUP BYorganizes them in alphabetical order. Based on that, it can be said that the first statement executes faster. If so, when handling large volumes of data, would the difference in performance be considerable?

5 Answers

Voted

Leandro Tuttini · Answer 1 · 2020-01-21T12:36:29+08:00

Although it is clear that both techniques obtain the same final result, not all of them would be seen as valid for the result you want to achieve.

Taking into account the proposal that you make, the correct thing to do would be to use the DISTINCT, since it applies to the row, instead the GROUP BYwas created to work with aggregations such as the SUM(), MAX(), AVG(), etc.

The issue of order would not be a problem because one ORDER BYwould resolve the difference.

In these links, although they are in English, the same issue was raised:

Luis Suarez · Answer 2 · 2020-01-21T12:28:02+08:00

GROUP BYIt is used more for operations of the type: count, sum, etc.

Depending on the number of records in the table (talking about millions of records), the select(whether with distinctor with group by) will take more or less the same time

If the case is that the table has millions of records (100, 200, 500), sometimes it is best to extract the data that you want to group in a temporary table ( select ... insert) and on the temporary table execute the distinctor the group by. The query time is considerably much faster.

Juan Ruiz de Castilla · Answer 3 · 2020-01-22T12:42:09+08:00

In addition to what Leandro comments and as a faithful translation of one of the answers in the link that he himself attaches, the answer varies between engines but you can have a scope of these two database engines:

RPTA:

There is no difference (in SQL Server, at least) Both queries use the same execution plan.

http://sqlmag.com/database-performance-tuning/distinct-vs-group

Perhaps there is a difference, if there are subqueries involved:

http://blog.sqlauthority.com/2007/03/29/sql-server-difference-between-distinct-and-group-by-distinct-vs-group-by/

No difference (Oracle-style):

http://asktom.oracle.com/pls/asktom/f?p=100:11:0::::P11_QUESTION_ID:32961403234212

original answer

Jairo1010 · Answer 4 · 2020-01-21T12:35:08+08:00

The function DISTINCTremoves duplicate records, the function GROUP BYis implemented to group records.

The function DISTINCTis executed as follows:

Copy all business_keyvalues to a temporary table
Sort the temporary table
Parses the temporary table, returning each element that is different from the previous one

The function GROUP BYis executed as:

Search the full table, store each business_keyin ahashtable
Return the keys tohashtable

The first optimizes memory, while the second optimizes speed but requires a large amount of memory depending on the number of keys.

Greetings.

AzidRain · Answer 5 · 2020-01-21T12:23:07+08:00

The first option just filters the rows as it finds them but has to go through all of them to get the result. When you use group bythe primary returned result it is reprocessed to sort it according to the grouping value, in your case, by "category". Without using indexes, the first option is faster. However if you put an index on the "category" field then the query with group byis almost as fast. Keep in mind that each alternative is used according to the result you need.

Performance difference when querying with SELECT DISTINCT and GROUP BY?

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?