Hi. I have a MYSQL database with 1,000,000 podcast episodes.
Currently I am labelling the episodes by topic e.g "India" or "Machine Learning" or "True Crime". This system using simple match(title,description) against ('India') for example, storing into a cached results table e.g (episodeid, topicid, match_strength).
While this is easy it’s slow i.e. I have to create cached results to access them. If we have 500+ topics it will require multiple cron jobs to update database regularly.
My end goal is to catalogue all the podcasts under topics.
I’m looking for a more elegant method to do this, especially as we go from 1m -> 10m episodes. Interested to chat?
Our system is built on PHP MYSQL so all solutions will need to reflect this, thanks.