How a leading consumer insight brand uses Dataproc to hyper-personalise faster

At RVU, we have a clear and vital mission: empower people, transform industries.

For our market-leading home management and switching brands — Confused.com, Uswitch, Tempcover, Money.co.uk, and Mojo Mortgages — transparency and accurate information are everything. Today’s consumer expects more than a simple comparison table; they want personalized recommendations tailored to their unique circumstances.

Delivering on that promise — building a true personalization engine that powers all our brands — requires a data foundation capable of processing massive, complex datasets for sophisticated ML models. Today, our platform powers hundreds of automated personalization campaigns, optimized with billions of data points from across all our brands. We tackled this challenge using the power of Google Cloud and its two solutions for Apache Spark: Dataproc and Google Cloud Serverless for Apache Spark. Together, we’re making our mission a reality.

The high-speed engine for feature engineering

Our relationship with Google Cloud isn’t new. In fact, we’ve used BigQuery as our unified data platform for over a decade. Coming from a performance marketing background, we’ve always dealt with a lot of data, but we recognized early on that we’re not a digital infrastructure company. Instead, our focus must always be on where the value is. Managed solutions like BigQuery that eliminate infrastructure and capacity headaches were a natural fit from the start.

The key challenge was stitching together a meaningful and coherent picture of customer behavior across our brands — turning countless fragmented interactions into something that genuinely reflects how a user behaves, clicks, and makes decisions. Instead of relying on isolated events and aggregate views, we’ve had to build a platform capable of connecting these signals into a narrative that works for our machine learning models.

Using Dataproc to support this was a gamechanger. The biggest impact has been its role as our core high-speed Spark processing engine, primarily for feature engineering for our ML model development. Feature engineering, which is the crucial process of shaping all that raw customer data for our data science models, is a real value-driver for us. It’s where we have a marked competitive edge.

The result has been a significant leap in our innovation velocity. With Serverless for Apache Spark, we now have the ability to shape our customer data for feature engineering in just a matter of days. Previously, this would have taken weeks. We’ve also dramatically reduced our time-to-market, which also used to take weeks. Now, a new contractor can join the team and deliver a model, including exploratory data analysis and all feature engineering, in only a week and a half. That’s incredibly fast.

Delivering personalized experiences

By improving our speed of innovation, we’re better positioned to deliver a personalized user experience to our customers and partners.

Our hyper-personalization journey accelerated once we moved to Spark. We can now run heavyweight data processing jobs that crunch vast amounts of behavioral and contextual data, allowing us to build models that generate genuinely meaningful predictions.

These models help us understand not just what to say to a customer, but when and how to say it — selecting the right moment and the right channel to deliver personalized insight that genuinely resonates.

Building a future vision

Google’s Data Cloud directly aligns with our culture of prioritizing value, and its impact on our business is profound. I call it the network effect, where everything seamlessly connects within the same ecosystem: Our data resides in BigQuery, our ability to validate, enrich, and transform that data is tied to Dataproc and Serverless for Apache Spark, and our capacity to deploy the ML models spans the network. It’s all co-located and integrated, powering the real-time accuracy of our consumer brands and giving us a competitive advantage.

For our engineers, the big win is the lack of infrastructure they have to deal with. They can press a button that processes all the data in 10 minutes, rather than having to set up a network of clusters and servers and make them talk to each other. It’s incredibly efficient and frees up time for more valuable work like building and iterating data products.

Dataproc has upped our speed, scale, and agility. It also gives us the tools to innovate with AI as we build the future of hyper-personalization. Today, we’re proud to say RVU’s cutting-edge tech and data are helping millions of UK consumers make smarter, more informed decisions, and truly transforming industries.

Inspired by RVU’s success? Whether you need persistent clusters with Dataproc or the agility of Serverless Spark, Google Cloud has a managed solution to help you focus on value, not infrastructure. Discover the right Spark on Google Cloud for your use case.

How a leading consumer insight brand uses Dataproc to hyper-personalise faster

Looker self-service Explores | Google Cloud Blog

Leave a Reply Cancel reply

You might also like

How a leading consumer insight brand uses Dataproc to hyper-personalise faster

Looker self-service Explores | Google Cloud Blog

Looker Embedded adds conversational analytics

Ironwood TPUs deliver 3.7x carbon efficiency gains

Best WiFi Router For A Large Home | 2024

How to Set Up a Wireless Router as an Access Point

Stay Connected

Welcome Back!

Create New Account!

Retrieve your password