I used to do this the "old fashioned" way: \* log the queries from the applicati...

lfittl · on Nov 4, 2021

(author here)

Nice approach! The latter part of that workflow is where we are focused now, i.e. now that we can plan and recommend B-tree indexes for individual queries, how do we utilize this to recommend a set of indexes for a whole database (taking into account different predicate variations).

On open-sourcing this code: Not at this moment - we do however offer a WASM build of this logic for free use at https://pganalyze.com/index-advisor

jerrysievert · on Nov 4, 2021

> The latter part of that workflow is where we are focused now, i.e. now that we can plan and recommend B-tree indexes for individual queries, how do we utilize this to recommend a set of indexes for a whole database

what I've ended up doing is trying to approximate the number of rows, create dummy data that has a similar spread of data, and create indexes and query until I have a good understanding of exactly how each index behaves with the amount of data. It ends up being a lot of work, but sometimes getting a 50% performance increase is worth that amount of work overall.

> On open-sourcing this code: Not at this moment

completely understandable, the selfish side of me hopes that it will be open sourced eventually, but the practical side of me sees the benefits to your company for keeping it private for now.

gopalv · on Nov 5, 2021

> create index recommendations with and without predicates

I messed around with Postgres hypothetical indexes[1], which was a cheap way to create a lot of them and see what happens.

[1] - https://hypopg.readthedocs.io/en/rel1_stable/

jerrysievert · on Nov 5, 2021

those also require real or simulated data to get good recommendations, which is sometimes hard. if you're going to go that far, you might as well create real indexes.

besides, in some environments access to the real data (if there's PII) is much more difficult to get, so simulating is as far as you can go.