How Big Data Is Empowering AI and Machine Learning at Scale

Big Data is powerful on its own. So is artificial intelligence. What happens when the two are merged?

Big data is moving to a new stage of maturity — one that promises even greater business impact and industry disruption over the course of the coming decade. As big data initiatives mature, organizations are now combining the agility of big data processes with the scale of artificial intelligence (AI) capabilities to accelerate the delivery of business value.

The Convergence of Big Data and AI

The convergence of big data with AI has emerged as the single most important development that is shaping the future of how firms drive business value from their data and analytics capabilities. The availability of greater volumes and sources of data is, for the first time, enabling capabilities in AI and machine learning that remained dormant for decades due to lack of data availability, limited sample sizes, and an inability to analyze massive amounts of data in milliseconds. Digital capabilities have moved data from batch to real-time, on-line, always-available access.

Although many AI technologies have been in existence for several decades, only now are they able to take advantage of datasets of sufficient size to provide meaningful learning and results. The ability to access large volumes of data with agility and ready access is leading to a rapid evolution in the application of AI and machine-learning applications. Whereas statisticians and early data scientists were often limited to working with “sample” sets of data, big data has enabled data scientists to access and work with massive sets of data without restriction. Rather than relying on representative data samples, data scientists can now rely on the data itself, in all of its granularity, nuance, and detail. This is why many organizations have moved from a hypothesis-based approach to a “data first” approach. Organizations can now load all of the data and let the data itself point the direction and tell the story. Unnecessary or redundant data can be culled, and more indicative and predictive data can be analyzed using “analytical sandboxes” or big data “centers of excellence,” which take advantage of the flexibility and agility of data management approaches. Apostles of big data have often referred to their approach as “load and go.” Big data enables an environment that encourages data discovery through iteration. As a result, businesses can move faster, experiment more, and learn quickly. To put it differently, big data enables organizations to fail fast and learn faster.

Big Data and AI at MetLife

Pete Johnson is one of the most experienced executives working in the field of big data and AI within the industry today. Having worked in the field of artificial intelligence for a generation dating back to his academic career at Yale University, Johnson now leads big data and AI initiatives as a fellow at MetLife. Johnson previously held positions as senior vice president for Strategic Technology with Mellon Bank and served as the executive vice president and chief technology officer of Cognitive Systems Inc. (CSI), an early artificial intelligence company specializing in natural language processing, expert systems, case-based reasoning, and data mining. CSI was founded by several members of the Yale University faculty in 1981 when Johnson completed his MS in computer science.

Johnson, whom I’ve known for over a decade, is a regular participant in a series of executive thought-leadership breakfasts that I host for senior industry executives to share perspectives on topics in big data, AI, and machine learning among their peers. Participants in the most recent executive breakfasts have included chief data officers, chief analytics officers, chief digital officers, chief technology officers, and heads of big data for firms including AIG, American Express, Blackrock, Charles Schwab, CitiGroup, General Electric (GE), MetLife, TD Ameritrade, VISA, and Wells Fargo, among others. As a long-suffering expert in the field of artificial intelligence, Johnson observes three critical ways in which big data is now empowering AI:

  1. Big data technology — We have the ability now to process huge quantities of data that previously required extremely expensive hardware and software, or “commodity parallelism.”
  2. Availability of large data sets — ICR, transcription, voice and image files, weather data, and logistics data are now available in ways that were never possible in the past; even old “paper sourced” data is coming online.
  3. Machine learning at scale — “Scaled-up” algorithms such as recurrent neural networks and deep learning are powering the breakthrough of AI.

Read the entire article on MIT Sloan Management Review

Share this post