Update 'How fast are really the analytical DBMS?'

Guillem Borrell Nogueras 2023-02-19 09:19:25 +01:00
parent 4e3a7415aa
commit 8ef9c5a3ad

@ -10,6 +10,32 @@ I did a small experiment with a synthetic dataset I created for the PyCon Spain
![Table](https://git.guillemborrell.es/guillem/blog/raw/branch/main/images/analyticdbms/Screenshot_20230212_215503_Chrome.jpg) ![Table](https://git.guillemborrell.es/guillem/blog/raw/branch/main/images/analyticdbms/Screenshot_20230212_215503_Chrome.jpg)
The query is designed to crash naive query engines, with a CTE and two nested aggregations:
```sql
with by_nplayers as (
SELECT
max(toss) as toss,
count(*) as qty,
count(*) as nplayers,
game
FROM
boards
GROUP BY
game
)
select
sum(qty) as qty,
toss,
nplayers
from
by_nplayers
group by
toss, nplayers
order by
nplayers, toss desc
```
Here are the results for postgresql Here are the results for postgresql
![Postgresql timings](https://git.guillemborrell.es/guillem/blog/raw/branch/main/images/analyticdbms/Screenshot_20230212_213038_Chrome.jpg) ![Postgresql timings](https://git.guillemborrell.es/guillem/blog/raw/branch/main/images/analyticdbms/Screenshot_20230212_213038_Chrome.jpg)