What's the biggest mistake people make when checking synthetic data for bias?
Treating synthetic data bias detection the same as real data auditing. With synthetic data, you're auditing the generator's algorithm and your own assumptions, not just finding societal biases captured in historical records. People often forget to check for 'mode collapse,' where the generator produces limited variation, creating a false sense of security.