CPG data analytics, part 1

4 minute read

Data is more important than ever for CPGs. By 2025 Nielsen and IMF estimate that nearly 60% of Americans will shop online for groceries. That activity can provide an unprecedented amount of information for manufacturers. 

Many of them are experiencing data overload. How can they overcome that? How can they make that data actionable? What use cases can be created that demonstrate the usefulness of CPG data analytics? This article provides answers.

This is the first of two articles on this subject. The second article looks at establishing a data strategy, data lakes, and a glimpse of the next generation of data analytics.

Table of Contents

CPG data analytics management

A number of forces already present in the CPG industry have been thrust forward during the pandemic: the meteoric rise of ecommerce and omnichannel, changing customer behavior, advertising personalization and growing supply chain complexity. 

All of these phenomena are generating data and many CPGs are overwhelmed by it, experiencing data overload.

This common issue is part of the digital transformation occurring at many large CPGs. It needs to be addressed by cross-functional teams as opposed to being regarded as an IT task. Silos prevent information and skill sharing which is necessary for CPGs to move forward. 

The agility that challenger brands possess has allowed them to gain market share during the pandemic, and harnessing data has contributed to their success. 

These smaller brands lack the infrastructure that creates silos. The proximity of teams, shared tasks and rapid, data-driven decision making has given them an edge that is eroding established CPG’s dominant position. 

Effective data management, analysis and usage requires input from–and benefits–several teams. The insights data analysis can provide are relevant to ecomm managers, KAMs, revenue management, supply chain teams, category managers and other teams. Data-driven conclusions are also critical to the C-suite. Business leaders need to be data literate so that they can recognize and express their data needs, and make it a priority. 

CPGs should designate people as owners of data sets and processes. This way responsibilities are clearly laid out and decisions about data sources and access rights can be made rapidly. Historically, CMI/ business insights/ shopper insights teams (as they’ve variously been called) have bought and handled data so they are well positioned to guide these initiatives.

The potential for data

Insights gleaned from data can be powerful enough to spearhead a company’s strategy. McKinsey reports that data-driven CPGs are outperforming their counterparts and can increase net sales value by 3 to 5 percent. 

The list of potential applications for data analysis is long and growing. Currently, it can provide new product development ideas, optimize pricing and promotions, personalize marketing, analyze sales performance and predict which causals will produce future sales. 

Data analysis can provide shopper behaviour insights and provide opportunities for unexpected and profitable partnerships. New applications for data analytics continue to emerge.

Reliable data

There is a lot of excitement around data now, and for good reason.

However, for data to be effective it must be accurate. 

If, for example, an ecomm manager approaches a retailer saying their OOS is 20%, that data needs to be correct. Eroding a retailer relationship is not an option.

The most accurate availability data is location-based data. This kind of data is collected at the SKU level from every online point of sale. With it, a brand can approach a retailer and show them precisely which estores have high OOS rates, and during what periods. This information can benefit both the retailer and the brand. 

Most solution providers use sample-based data for OOS rates. This kind of data is misleading and non-actionable. It’s misleading because the analysis it provides only reflects the reality in a small selection of stores. For example, our research shows that a sample of 10 online stores yields an accuracy rate of only 3.13% and a sample of 100 stores gives an OOS rate that is only accurate for 31.25% of total estores.

Further, sample-based data is non-actionable. Since this kind of data only refers to a small selection of estores (Walmart US, for instance, has over 4000), it’s not possible to locate exactly which stores have OOS rates that need to be addressed. With location-based analytics, it’s possible to filter stores by location, period and varying OOS rates.

Digital shelf and other types of data held by CPGs

CPGs typically receive a never-ending flow of data that comes from a range of sources. Supply chain teams, for example, can have thousands of data collection points originating from planning teams, factories, distribution centers, retailers, service providers and other partners. 

Common sources of data include:

Activity data, which indicates what field reps in the field are doing and achieving. It details the actions they take, at which stores, warehouses and outlets, and when or how often.  

Digital shelf data, such as price, availability, promotion status, visibility, competitor activity and more. CPGs also have similar data about the physical shelf collected in-store by reps with additional points like planogram compliance, store demographics, end caps, etc. 

Sales data, ideally by SKU, by store, by day, both current & historical. This data is very powerful when combined with digital shelf data.

Causal data. This data tracks the business levers (aka digital shelf KPIs) that play a role in the sale of a product. With these insights, it’s possible to pinpoint which digital shelf KPIs need to be optimized to increase sales. 

Promotion planning and related demand planning data. With this data, scheduling promotions in different regions and predicting the lift they will bring is compared with actual sales figures. This helps isolate contributing factors and maximize promotional performance. 

Supply chain data. This usually includes shipment and depletion data, backorders, warehouse lead times and much more. During the pandemic supply chain data became particularly important. As a result, the tech behind it has been developing quickly, and the data points have multiplied. For instance, real-time item tracking has become the industry standard. 

Consumer behavior data. Social media analytics have evolved quickly. Identifying emerging shifts and trends in consumer behavior is possible in near real-time, a powerful lever for brands. 

Eretail media data. A tectonic shift is occurring in online advertising. The approaching end of third party cookies and an increase in direct to consumer selling has created a lucrative opportunity for retailers to create their own media platforms through their online stores. The shopper data they collect is valuable to advertisers because it offers personalized targeting. This proliferation of retail media networks is changing the equation for CPGs. They no longer have to rely on Google, Facebook and Amazon for this kind of data.

Over time, and increasingly, CPGs acquire a lot of detailed data.

The big question is: how to make it actionable?

Read the second article in this series to find out. 

Privacy policy

Data collection - Use of cookies - Consent

DataImpact undertakes to ensure that the collection and processing of your data, carried out from the www.dataimpact.io site, comply with the Data Protection Act and the RGPD. This processing is necessary for the execution of our services and the internal functioning of our company. For any information on the protection of personal data, you can also consult the site of the Commission Informatique et Liberté www.cnil.fr.

Identity of the data owner:

Personal data are collected by : Société par actions simplifiée DataImpact whose registered office is at 39 Rue Lucien Sampaix, 75010 Paris, RCS PARIS 799 367 222 T: +33 (0)1 42 51 87 08

Purpose - use of your data:

DataImpact is likely to collect personal data about you for the purposes necessary for its activity, whether in terms of recruitment, responding to your requests for information, execution and monitoring of service contracts. Types of data collected: DataImpact only collects data that is strictly necessary for the purposes of its activity. The personal data collected can be the following:

-In the context of a request for information (name, first name, email, telephone, company name).

-As part of a recruitment process: (surname, first name, email, telephone, company name), information on the curriculum vitae (marital status, surname, first name, date and place of birth, nationality, professional background, academic background, hobbies)

-If necessary, connection data including your IP address may be collected for purely statistical purposes.

Origin of the data:

The personal data collected by DataImpact are those directly given by the person concerned when using the contact form or surfing on the site www.dataimpact.io.

Intended transfers of personal data to a non-EU Member State:

To date, DataImpact does not transfer, nor envisage any transfer of your personal data to a non-European Union member state.

Retention period of the categories of data processed:

Connection data are kept at the latest within one year after connection to the www.dataimpact.io website.
Data relating to applicants for a post are kept at the latest five years after the last contact, with a view to possible recruitment.

Data of prospects are kept no later than three years after the last contact.

Customer data are kept for the duration of the service contract.

Protection of your data:

DataImpact ensures that its employees and service providers, subcontractors or hosts, also respect the absolute confidentiality of the information provided to them.

We maintain in-house electronic and organizational security measures in relation to the collection, storage, and communication of data.

Your rights under the Data Protection Act:

DataImpact takes all appropriate measures in order to facilitate the exercise of the rights of its clients regarding their personal data (right of access, rectification, deletion, limitation of processing, portability, to define the fate of its data after death).

The information provided in connection with the exercise of these rights is provided in writing or electronically. On request, the information may be provided orally. All requests should be sent by post to 739 Rue Lucien Sampaix, 75010 Paris or to [email protected].

In accordance with the regulations in force, your request must be signed and accompanied by a photocopy of an identity document bearing your signature and specify the address to which the reply should be sent. A reply will then be sent to you as soon as possible and in any event within one month of receipt of the request.

Flows out of your data after your death:

The new article 40-1 of the French Data Protection Act allows individuals to give instructions regarding the storage, deletion and communication of their data after their death.

You can read the procedure relating to these directives by following the following link: “https://www.cnil.fr/fr/ce-que-change-la-loi-pour-une-republique-numerique-pour-la-protection-des-donneespersonnelles#mortnumerique”.


You are informed that, during your visits to the www.dataimpact.io website, a cookie may, if necessary, be automatically installed on your browser software. A cookie is a small file stored on your computer. As such, it is a block of data that does not allow users to be identified but is used to record information relating to their browsing on the site. Cookies are used, on the one hand, to facilitate your navigation on the site and, on the other hand, for statistical purposes. In order to better know the frequentation of the site, we (mainly) measure the number of pages viewed, visitors, visits, as well as the activity of visitors on our site and their frequency of return.

The parameters of the browser software make it possible to inform about the presence of cookies and possibly to refuse them in the manner described at the following address “http://www.cnil.fr/vos-libertes/vos-traces/les-cookies/”.

You have the right to access, withdraw and modify personal data communicated through cookies under the conditions indicated above.

Terms of Service

Article 6 III of the Law of 22 June 2004

Société par action simplifiée DataImpact
39 Rue Lucien Sampaix, 75010 Paris
T: +33 (0)1 42 51 87 08
M: [email protected]
RCS PARIS 799 367 222

Director of publication: Yacine TERKI

Hosting : O2 SWITCH 222 Boulevard Gustave Flaubert 63000 Clermont-Ferrand

Terms and conditions of use:

The information contained and consultable on this site is provided for information purposes by DataImpact. They can be modified at any time without notice. Under no circumstances does it constitute advice or a service of any kind whatsoever. You assume full responsibility for the use of this site or the information it contains.

DataImpact cannot be held responsible for damages related to the consultation or use of the website by the user. Hypertext links may refer to third party sites over which DataImpact has no control.

DataImpact declines all responsibility for the content of these sites. The use of this service is reserved for strictly personal use. Any reproduction or representation, of all or part of the information, brochures or logos contained on the site, on any medium whatsoever, is prohibited. Failure to comply with this prohibition constitutes an infringement that may result in civil and criminal liability of the counterfeiter.