HPC and big data convergence in fraud detection

HPC Insights

The convergence of two existing disciplines can be an explosively creative force. A great example within the world of tech is the convergence of HPC merging with big data and machine learning. Though in many ways this convergence is still in its early stages, the merging of these technologies is already starting to deliver concrete, real world benefits in the fraud detection field, helping save financial firms hundreds of millions of dollars.

Unsurprisingly, PayPal is one of the companies at the forefront of this convergence. As an online transaction processor that was conceived on the Internet, Paypal has grown up exposed to virtually every cyber security threat and fraud imaginable. Because of this, the company has been aggressively pursuing a security strategy that used both HPC and big data technologies as early as 2001. Although PayPal keeps details of its fraud protection systems a secret, it has been very open about leveraging the flexibility of the open-source H20 machine learning framework in conjunction with its big data infrastructure, which gathers more than 20 terabytes of log data every day.

To gain insight from this massive Hadoop dataset, Paypal, which handles over 13 million online monetary transactions per day, combines the three types of machine learning — linear, nonlinear, and deep learning — to help identify and stop fraudsters. The company estimates that just in the first few years of deploying their fraud detection systems, they’ve saved over $700 million dollars in fraudulent transactions that they otherwise wouldn’t have noticed.

Though PayPal may be have been one of the first to recognise the value of converging HPC and big data technologies, today virtually all the major financial services firms are seeking novel ways to combine these technologies in order to protect themselves. Another high-profile example is MasterCard, which has a staggering 2.2 billion cards in use in 330 countries, and handles roughly 160 million transactions per hour, or 52 billion transactions a year. MasterCard, much like PayPal, employs a hybrid machine learning approach that uses supervised and unsupervised learning, in addition to traditional big data technologies like Hadoop and Spark, to examine the location, spending habits, and travel patterns of the customer before each purchase is made. According to Vice President of Global Big Data Consulting at MasterCard, Nick Curcuru, the company’s infrastructure applies 1.9 million distinct rules to examine each transaction, and processes each of these transactions in just milliseconds.

Other major institutions have developed their own mix of technologies to protect themselves against fraud. CitiBank, for example, has recently made investments in machine learning companies like Feedzai, Cylance and Ayasdi in order to bolster its fraud detection capability, and the company just recently announced that it would open a branch of its Global Innovation Lab at a London WeWork, specialising in the development of big data and high performance computing technologies.

The quest for better fraud protection is a never-ending one, however, as credit card fraud continues to increase in severity. Credit card fraud in Europe caused 1.8 billion Euro in damages in 2016 (with the UK and France accounting for 73% of that), while in the US credit card fraud has been a persistent problem, and even debit cards, which until now have been relatively safe, are now starting to see a rise in fraud.

The problems of controlling fraud are compounded by demands from consumers, who want faster, easier payments, and greater flexibility. This pressure was almost certainly a factor when the Payment Card Industry (PCI) Security Standards Council - the regulatory body that oversees transactions from credit and debit cards - decided to start allowing PIN numbers to be entered on mobile phones. The decision, which may benefit the user experience, also introduces a new level of vulnerability into the transaction process, and therefore has elicited some skepticism.

Delivering an improved user experience while also maintaining maximum security will necessitate improved fraud detection systems. This points to a need for stronger, more deeply converged systems that can gather deeper insight from customer and transaction data. Realising the better union of HPC, big data, and machine learning will require a deeper harmony between once heterogeneous computational infrastructures, and perhaps also a shift in perception. Challenges aside, major players are already making serious commitments to this convergence, and envision a not-to-distant future where the three workloads can be seamlessly processed on one system.

Verne Global, which provides its clients in these fields with low-cost, fully-sustainable power resources is in an ideal position to help facilitate this transformation — a role we’re eager to fulfill.

Written by Spencer Lamb

See Spencer Lamb's blog

Spencer is Verne Global's Director of Research and head's up our high performance computing work with European research and scientific organisations. He is also a member of the European Technology Platform for High Performance Computing (ETP4HPC).

Related blogs

HPC & AI @ ISC19 – Rumours from the Trade Show Floor

My summer usually starts with the International Super Computing (ISC) show in Frankfurt, and this year was no exception. Across the previous week in London at the AI Summit it rained every day, but thankfully high-performance computing’s influence on weather forecasting clearly shone through and in Frankfurt it was sunny and a warm 30⁰C.

Read more

Heavy metal and tag lines – Rumours from the trade show floor

On the cusp of spring I regularly refresh my GPU technology suntan at the Nvidia GPU Technology Conference (GTC) in San Jose. This year was fascinating as the speed and scale of both AI and Virtual Reality industries has leapt forward. Here are my takeaways...

Read more

DevOps. Stacking up to be a Common Theme at ISC18

Yesterday was a good day to be in Frankfurt. All of the majors in the supercomputing universe descended upon the Messe Frankfurt to begin ISC18 with a series of training seminars. For my morning session, I chose Getting Started with Containers on HPC through Singularity put on by the team at Sylabs, Inc. I have been tracking the progress on Singularity in the HPC community since before Sylabs was founded by CEO Greg Kurtzer in an effort to bring the technology of root secure containers into the realm of enterprise supported software. I was excited to hear about the progress that Sylabs has made and to see where the future of containers lies for the broader HPC community. If I was forced to sum the tutorial into a single portmanteau, it would be DevOps. After this session, it is clear to me that the world of DevOps that has been created in the cloud native universe is on a collision course with HPC. And the future of science says that it can’t happen soon enough.

Read more

We use cookies to ensure we give you the best experience on our website, to analyse our website traffic, and to understand where our visitors are coming from. By browsing our website, you consent to our use of cookies and other tracking technologies. Read our Privacy Policy for more information.