What is an Identity Graph? | Everything You Need To Know
On an average, consumers in the US are using four devices each day. In all likelihood, they will not identify themselves (login) from most of the devices. Every activity across these devices, leaves behind crucial data about the customer which if stitched correctly can become extremely valuable with time.
‘Stitched’ is the key word here because your analytics tool is counting them as unique users. Check your Google Analytics for the ‘New vs Returning’ report. It will most likely show an unusual number of new visitors (>80%). This is because your customers are using multiple browsers across multiple devices and your web analytics is counting all visits from each browser as a unique person.
To add to the complexity, businesses are using multiple tools that store some form of customer information. Systems like CRM, Email, Ecommerce, Point-of-Sale and Social Media Tools store different customer attributes like contact details, transactions, messages, sessions and ad interactions. Businesses are left with isolated data sets and are paralyzed when it comes to connecting these data silos.
This can be a big challenge in maintaining up-to-date accurate customer profiles and behavior data. Businesses rely on accurate customer profiles to deliver relevant messaging and experience.
Hence the need for a technology that can resolve customer’s identities across devices and databases in one profile. This unified profile is also called as the ‘single view of customer’.
What is an Identify Graph?
An Identity Graph (ID Graph) is a database that stores all identifiers that correlate with individual customers. These identifiers could be anything from usernames to email, phone, cookies and even offline identifiers like loyalty card numbers.
The process of matching and linking customer records from disparate sources is called identity resolution. While it makes sense to link customer records lying in different silos, it is also important to link households and devices so you can fine tune targeting as required.
It is common for members of a family to use the same account. Take Amazon Prime, Netflix and Doordash for example. Many of these accounts are shared between family members. Businesses should treat user accounts as households. The name of the person on the account is just the most prominent member of the household.
For identifying users in a household, businesses can use device clustering; which is the process of grouping similar devices based on IP sharing history and browsing patterns.
Probabilistic vs Deterministic Matching
The matching algorithms used to stitch the profiles together are of two types; deterministic and probabilistic.
Deterministic matching is when you match the profiles with 100% certainty using identifiers like hashed email, phone or logged in username.
Probabilistic matching is not 100% certain and relies on identifiers like IP address, Device Type, Browsers and OS.
One of the techniques in probabilistic matching is to track the IP addresses of devices during different times of the day. For example if you detect three devices connecting from the same IP during day time and all three connecting from a different IP address in the evening, it’s probable that these devices belong to the same individual. The hypothesis here is that the individual is carrying the devices with him and connecting to different Wi-Fi networks throughout the day. The match is not 100% but you can achieve a higher accuracy by triangulating with different algorithms.
For example if you detect three devices connecting from the same IP during day time and all three connecting from a different IP address in the evening, it’s probable that these devices belong to the same individual.
Amit R G
Provide ID graph powered personalized customer service for your Ecommerce store
What is the underlying technology?
While there is no one technology solution to achieve Identity Resolution. The most essential component is the database responsible for maintaining billions of users and their relationships. A family of database specially designed for this application are called graph databases. These databases are extremely powerful and made specifically for the purpose of managing connections between billions of identities with millisecond latencies.
Not surprisingly, every major tech company uses them. Facebook uses graph database to determine connections like ‘friends of friends’ and generate timeline feeds. Amazon uses graph database to make product recommendations by seeing what others bought with the product you are looking at. Richpanel Customer Data Platform uses Artificial Intelligence to predict links between customers and devices and store these relations in a graph database.
Applications of Identity Graph
Identity graphs are primarily responsible for stitching customer identities and creating a ‘single view of customer’ which is the most accurate, up-to-date snapshot of customer attributes and behaviors. Once a business achieves this, they can use it for a number of applications including-
- Personalized customer service – Support agents use 6-10 tools to service customers. An ID Graph consolidates data from multiple sources so agents have full context about the customer. And can resolve customer queries quickly, at scale.
- Cross device attribution – Users interact with multiple channels during their lifecycle with a business. An ID Graph helps businesses identify the role of each channel in converting the user. This allows businesses to make better decisions when it comes to budgeting for marketing channels.
- Personalized in-app experiences – Noticed how Netflix and Amazon are always able to recommend the most relevant products (or shows)? They use ID graphs to track browsing history across devices and screens and recommend content that is most relevant for that screen and time. For example: If you run a video website and the user is accessing the content on mobile during commute, it makes more sense to customize the home page with more short videos.
- Deliver effective promotions -Track how customers interact with promotions to deliver context-aware messaging on the right screen at the right time. You can also set up workflows to set up multi stage messaging where users receive different messages depending on their stage in the conversion cycle.