O`Reilly 的 《 Architecting Data Lakes Data Management Architectures for Advanced Business Use Cases 》,全面介绍了数据湖的构架、工作机理、构建与管理、规划、价值、展望等诸多方面的内容。
其目录如下:
1. Overview
What Is a Data Lake?
Data Management and Governance in the Data Lake
How to Deploy a Data Lake Management Platform
2. How Data Lakes Work
Four Basic Functions of a Data Lake
Management and Monitoring
3. Challenges and Complications
Challenges of Building a Data Lake
Challenges of Managing the Data Lake
Deriving Value from the Data Lake
4. Curating the Data Lake
Data Governance
Data Acquisition
Data Organization
Capturing Metadata
Data Preparation
Data Provisioning
Benefits of an Automated Approach
5. Deriving Value from the Data Lake
Self-Service
Controlling and Allowing Access
Using a Bottom-Up Approach to Data Governance to Rank Data Sets
Data Lakes in Different Industries
6. Looking Ahead
Ground-to-Cloud Deployment Options
Looking Beyond Hadoop: Logical Data Lakes
Federated Queries
Data Discovery Portals
In Conclusion
A Checklist for Success
完整内容,可以在此下载:http://www.oreilly.com/data/free/architecting-data-lakes.csp?intcmp=il-data-free-lp-lgen_free_reports_page
也可以随时Email:Hiweb@Outlook.com 沟通探讨。
免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:is@yisu.com进行举报,并提供相关证据,一经查实,将立刻删除涉嫌侵权内容。