Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple persons several cases #6879

Closed
EDsCODE opened this issue Apr 26, 2021 · 4 comments
Closed

Multiple persons several cases #6879

EDsCODE opened this issue Apr 26, 2021 · 4 comments
Labels

Comments

@EDsCODE
Copy link
Member

EDsCODE commented Apr 26, 2021

@paolodamico and I spent an afternoon sifting through person data of various teams to understand how teams were using posthog.identify, specifically when email was provided, and if the problem with seemingly multiple copies of a person was a bug in implementation or an issue with how users were calling the function. We discovered several cases:

  1. Flawless usage where persons are never duplicated against emails (example)
  2. Duplicated persons where it seemed as though a company was calling posthog.identify on an incrementing ID which was causing multiples of a person. Here's an example
  3. Duplicated persons across organization ids which is an expected side effect of signing up on different deployments explained here
  4. Duplicated persons where the distinct_ids look like anonymous ids we generate. Here's an example of a person with multiple rows and the distinct_ids associated with them. Notice that none of the distinct_ids seem to be user passed in

*Duplicated means there are several person rows that all seem like they should be the same person based on their properties (email mainly)

Will continue adding findings/fixes to this thread

@EDsCODE
Copy link
Member Author

EDsCODE commented Apr 29, 2021

@kpthatsme tagging you so you have a heads up of some reasons for the duplicates

@paolodamico
Copy link
Contributor

Worth referencing https://github.com/PostHog/internal/issues/243 which explains the main culprit of duplicated persons, and the path forward of how we'll address it.

@tiina303 tiina303 transferred this issue from PostHog/plugin-server Nov 3, 2021
@posthog-bot
Copy link
Contributor

This issue hasn't seen activity in two years! If you want to keep it open, post a comment or remove the stale label – otherwise this will be closed in two weeks.

@posthog-bot
Copy link
Contributor

This issue was closed due to lack of activity. Feel free to reopen if it's still relevant.

@posthog-bot posthog-bot closed this as not planned Won't fix, can't repro, duplicate, stale Nov 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants