One of the biggest inhibitors to applying Hadoop in any production environment is the general lack of governance tools for IT organizations to use to manage access permissions for the data that resides there.
Matt Schumpert, director of product management at Datameer, says that because its software runs in memory as a Hadoop application, responsibility for data governance within Hadoop naturally falls to Datameer.
To solve that problem, the company is providing access controls within Datameer and publishing open bi-directional REST application programming interfaces (APIs) through which governance metadata can be shared with either open source Apache Sentry authorization softwarerunning on Hadoop or any third-party data management software that can consume a REST API.
The immediate goal, says Schumpert, is to provide a mechanism for data lineage that shows where analytics results originated and how data and analytics were used, modified or published via the Datameer user interface or the REST API.
Regardless of where the locus for data governance winds up being in the era of Big Data, IT organizations are going to need better governance tools. With more data than ever being collected in platforms such as Hadoop, the amount of sensitive data being actively managed by IT organizations will dramatically increase.
- How to Engage Enterprise Buyers in Meaningful Conversations in 2016 February 28, 2016
- nVIDIA Driving Deep Learning to the Forefront – Literally February 22, 2016
- New Technologies Disrupting the Legal Business in the UK February 17, 2016
- Shares of Tableau plunge 36% after company posts $41M loss in Q4 February 5, 2016
- LexisNexis Unveils Lexis® DiscoveryIQ eDiscovery Platform Enhanced by Brainspace February 2, 2016