Discussion about this post

User's avatar
Aleks Tiupikov's avatar

That's a very interesting topic. My biggest concern is how to automate the generation of metadata. From my experience working on text-to-SQL, the most helpful metadata is usually generated by humans.

One potential solution might be to use existing query pairs that a company already has (like revenue - query X, retention - query Y). You could feed these to the AI and ask it to generate column metadata based on that. However, I'm not sure how accurate that would be if the same column is used for different meanings. Who's going to decide what's the right definition?

Curious to hear your thoughts!

Expand full comment
1 more comment...

No posts

Ready for more?