Coming to the aid of such companies is California based Argyle Data, which uses state-of-the-art machine learning on a Hadoop stack to deliver fraud analytics applications, that can ingest data and analyse it in real time, reducing the window for risk or fraud from hours (or days) to minutes. Under the leadership of Tom Ryan, President and CEO, Argyle Data has built a team of experts on on applying machine learning on Hadoop at massive scale complemented by industry fraud experts. Prior to Argyle, Tom was president and CEO of Alpine Data Labs, a leader in predictive analytics for big data.
Working with some of the biggest players in mobile communications, financial services and ecommerce, Argyle’s products, use technology pioneered at Facebook and the NSA to deliver machine learning at petabyte scale to identify fraud and anomalous beheviour. Tom Ryan uses the phrase “When minutes means millions” to drive the prioritization of the company’s fraud analytics strategy.
ArgyleDB is an integrated solution that brings together all of the critical components to drive fraud analytics from the 1990’s to the era of machine learning and big data. They identified packet ingestion, schema-less indexing with time-series support, anamoly detection with machine learning and querying at petabyte scale as key requirements. ArgyleDB Ingest is able to non-invasively tap into the network and perform Deep Packet Inspection (DPI) and stream live network packets, or log files into a “Data Lake”. The data is stored in a key value database deveoped at the NSA that indexes the data in a schema-less way with time series support. ArgyleDB Machine Learning is able to access streaming data and perform anomaly detection using multiple algorithms that learn online against the full data set (both streaming and historical). This is critical to identifyiing fraud patterns where comparisons to a previous year/season is critical. ArgyleDB Query is a native Hadoop real-time SQL database that supports schema on read and queries the data lake through ANSI SQL. Complex joins, aggregation and windowing functions, that are critical to identifing fraud, are supported across 1000’s of nodes.
Roadmap for Argyle Data
Looking towards the future, Argyle Data’s mission is to re-write the book on fraud analytics and create a data lake on which to build a suite of related applications. The company also intends to take their unique ability to counter fraud, to additional customers in U.S.A and Europe, and plans to expand in Asia as well.