How to fight the dataset biases with synthetic data generation?

Tenet, C. Nolan (2020). Wondering if they filled the accident report correctly after this 🤔

Checkboxes generation with GAN’s?

What do these persons, cats and artworks hold in common? None of them exist! They have been fully generated with some GAN models
Should we open ?

The good old handcrafted generation method

Some examples of synthetic marks generated by the described method. Observe the impact of the elastic distortion (from top to bottom)

Going further: Data Generation for OCR tasks

On the left: real samples. In the middle: fake samples classified as realistic. On the right: fake samples classified as non-credible.



