OpenAI launched their latest model to only the top-tier users (highest paying ones). In their pre-launch tests, they got a certain answer accuracy score for their latest model. However, when analyzing the post launch data, they noticed a lower score. Why is that and how to fix it?