I built a Pikachu classifier because there's way too many of them.

Open In Colab

Creating over a thousand distinct creatures is pretty hard, right? I’ve always said that. At some point, it’s pretty understandable that GameFreak would have some repeated archetypes over 20yrs.

And for some reason, Electric Rodent is #1.

Depending on who you ask, there are upwards of a dozen entirely seperate ‘mons that fit into the Pikachu Family, a fan term for the litany of all-too-similar-looking mice, rats, squirrels, hedgehogs, and gerbils that (adorably) infest the Pokémon world.

Of course, this presents a distinct challenge for anyone who doesn’t want to take take a college course on poké-taxonomy and just wants to know which ‘mon they saw a cute plushie for at Target.

To that end, I built a Pikachu classifier because there’s way too many of them.

It covers how to set up an Azure API instance to scrape the images, cleaning the data for consumption (removing transparency, corrupt files), transforming the images (setting dataloaders, resizing, adding image transforms, etc), building and evaluating a CNN against the data (spoiler: it does pretty well), and finally pickling the model for later inference, I even added a little UI at the end where you can add your own images to see how it does.