Big Data Infra Admin 101: Ticket based learning, Big Data Admin Solving Tickets for Real life Job Like scenarios.
Course Description
Big Data Infra Admin 101: Ticket based learning
Why this course:
- Big Data Admin Solving Tickets for Real life Job Like scenarios
- Most courses teach topics but do not show you tickets to solve in real life
- This course also teachs you about python, shell code used for admin big data
- Making you ready for the tickets you can solve immedeitaly after this course
- Reading logs while compatification on hadoop
- Unstuck yourself in permissions and have limited access
- Reading old wiki pages and researching past progress by other users
- Readings logs and downloading logs from various places
The course will help you solve tickets on :
- Small files Issue: Hive small files & Non Hive Small files
- Remedial steps for small files: Understanding how to repartition, scripts for compactification based on python, shell or workflows
- Old files more than 6 month old
- Removing old Runs generally more than 6 months on different Lanes
- Removing files using workflow, shell scripts and shell commands
- Archiving files: remedy for old files
- Using wikipedia to read notes and save notes
- Official Release documents: Latex and MS Share points
- Different front end tools like dashboards, HUE, etc
- Making notes of all progress on wikipedia
- Python and Tableau based dashboards
Free