• skisnow@lemmy.ca
    link
    fedilink
    English
    arrow-up
    7
    ·
    3 days ago

    For a given individual, sure. If you’re trying to do some statistics over a whole group that you have no other record for, it could be useful.

    • bss03@infosec.pub
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      4
      ·
      3 days ago

      Sounds like those statistics output would the heavily biased by whatever process you were using to turn names into genders. In short, a bad idea.

      • TangledHyphae@lemmy.world
        link
        fedilink
        arrow-up
        4
        arrow-down
        1
        ·
        3 days ago

        “Since the dataset isn’t 100% perfectly annotated for analysis, we should give up the whole project entirely.”

        • Shanmugha@lemmy.world
          link
          fedilink
          arrow-up
          2
          ·
          edit-2
          2 days ago

          No, since the dataset is bound to give nonsensical results, we search for sources that are more precise. Hint: “Andrea” already mentioned and Japanese names