In order to dramatically reduce time to market on analytical projects, a credit reporting agency planned to build an analytical sandbox in Hadoop. This meant bringing over and analyzing five years’ worth of data from over 400 million consumers in Hadoop.
However, the agency’s data is also highly regulated according to state and federal mandates, as well as additional contractual and internal corporate security policies. In order to implement the sandbox, verified Hadoop data security was needed for personally identifiable information (PII) and PCI compliance.
The agency needed a solution that could balance both high-performance analytics and effective security in Hadoop.