Appjeniksaan

Emoji Dataset

For a project, I was looking for a relatively small dataset of all emojis in a workable format. I was able to find a few examples online, but all of them had some issues. Therefore, I created my own parser that goes through the Unicode list and creates a JSON file of around 60KB with all emojis, including their names for searchability.

The format of the file is quite minimal due to the overhead of the JSON format and looks like this:

{
    "g": "Smileys & Emotion", // Group Name
    "e": [
        ["😀", "grinning face"],
        ["😃", "grinning face with big eyes"],

The parser bundles the base emoji with the skin tone variants which are included in the following format:

["👋", "waving hand", ["👋🏻", "👋🏼", "👋🏽", "👋🏾", "👋🏿"]],

The Github project contains the JSON output of the parser, ready for use.