How to fight the dataset biases with synthetic data generation?

Tenet, C. Nolan (2020). Wondering if they filled the accident report correctly after this 🤔

Checkboxes generation with GAN’s?

What do these persons, cats and artworks hold in common? None of them exist! They have been fully generated with some GAN models
Should we open thischeckboxdoesntexist.com ?

The good old handcrafted generation method

Some examples of synthetic marks generated by the described method. Observe the impact of the elastic distortion (from top to bottom)

Going further: Data Generation for OCR tasks

On the left: real samples. In the middle: fake samples classified as realistic. On the right: fake samples classified as non-credible.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store