POSTER: ACCESS CONTROL MODEL FOR THE HADOOP ECOSYSTEM

J. Pavithra

Abstract: Apache Hadoop is an important framework for fault-tolerant and distributed storage and processing of Big Data. Hadoop core platform along with other open-source tools such as Apache Hive, Storm, HBase offer an ecosystem to enable users to fully harness Big Data potential. Apache Ranger and Apache Sentry provide access control capabilities to several ecosystem components by offering centralized policy administration and enforcement through plug-in. In this work we discuss the access control model for Hadoop ecosystem (referred as HeAC) used by Apache Ranger (release 0.6) and Sentry (release 1.7.0) along with Hadoop 2.x native authorization capabilities. This multi-layer model provides several access enforcement points to restrict unauthorized users to cluster resources. We further outline some preliminary approaches to extend the HeAC model consistent with widely accepted access control models.

Keywords: Access Control; Hadoop Ecosystem; Big Data; Data Lake; Role Based; Attributes; Groups Hierarchy; Object Tags.

Title: POSTER: ACCESS CONTROL MODEL FOR THE HADOOP ECOSYSTEM

Author: J. Pavithra

International Journal of Computer Science and Information Technology Research

ISSN 2348-1196 (print), ISSN 2348-120X (online)

Research Publish Journals

Vol. 6, Issue 3, July 2018 - September 2018

Citation
Share : Facebook Twitter Linked In

Citation
POSTER: ACCESS CONTROL MODEL FOR THE HADOOP ECOSYSTEM by J. Pavithra