How to fight the dataset biases with synthetic data generation?

Tenet, C. Nolan (2020). Wondering if they filled the accident report correctly after this 🤔

Checkboxes generation with GAN’s?

What do these persons, cats and artworks hold in common? None of them exist! They have been fully generated with some GAN models
Should we open ?

The good old handcrafted generation method

Some examples of synthetic marks generated by the described method. Observe the impact of the elastic distortion (from top to bottom)

Going further: Data Generation for OCR tasks

On the left: real samples. In the middle: fake samples classified as realistic. On the right: fake samples classified as non-credible.



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store