In theory, ChatGPT3 or the upcoming ChatGPT4 can be very good at processing scraped data and comparing it against the existing game entries at UVL, so that the majority of the extraneous data will be correctly excluded. And where it does make mistakes, it can be told exactly what it's mistake was and it is unlikely to repeat that same mistake ( if the correction is phrased correctly ). Of course for that level of training you would need two things. Premium (paid for) use of ChatGPT3 and a team of trainers that understands how ChatGPT3 thinks. Note, while ChatGPT3 can provide suggestions on the best way that a trainer should correct it, it does not fully understand itself well enough that this tactic is a reliable way to train trainers. (Or so chat GPT tells me.) When it comes to suggesting how trainers can best correct it, ChatGPT3 can only offer an alternate perspective on the best way, not necessarily a correct one.
Also note, when I talk about chatGPT as if it's self-aware, I merely imply to mean that it simulates certain aspects of self-awareness in many situations. On a technical level it doesn't "understand" anything, but you could ask it about fictional subjects and it will give you information about that subject in the context that is fiction (as if it understands the difference between fiction and reality).
I asked ChatGPT3 specifically about this several different ways and dozens of times. I gave it some sample web pages to look at from UVL and seven other game database sites. And ask for an estimate to compare the seven sites and enter or correct 75,000 games using the UVL API. Estimates range from 268 days exactly to 281 days and 4 hours and 8 minutes and 31 seconds for a specifically trained version of ChatGPT3. ChatGPT4 would take longer but be more accurate.