Apache AsterixDB for Beginners (Big Data Management System)
Apache AsterixDB for Beginners (Big Data Management System), AsterixDB is a BDMS (Big Data Management System) with rich feature set that sets it apart from other Big Data platforms.
Course Description
AsterixDB is a BDMS (Big Data Management System) with a rich feature set that sets it apart from other Big Data platforms. Its feature set makes it well-suited to modern needs such as web data warehousing and social data storage and analysis.
We will learn
- Introduction to AsterixDB
- Performance of Apache AsterixDB
- Key Features
- Example of Product using AsterixDB
- Installation of Apache AsterixDB
- Asterix Data Model (Dataverse, Datatype, Dataset)
- Creating Dataset and Index
- Loading Data into AsterixDB
- Arithmetic Operators
- Logical Operators
- Numeric Functions
- String Functions
- Tokenizing Functions
- Aggregate Functions (Array Functions)
- Comparison Functions
This course is designed for Beginners in Apache AsterixDB and looking out for a new job as Data Engineer, Big data Engineers or Developers, Software Developer.
You’ll walk away from this course with a real, deep understanding of Apache AsterixDB and its associated distributed systems, and you can apply it to real-world problems. Plus a valuable completion certificate is waiting for you at the end!
AsterixDB has:
- Data model
A semistructured NoSQL style data model (ADM) resulting from extending JSON with object database ideas - Query languages
Two expressive and declarative query languages (SQL++ and AQL) that support a broad range of queries and analysis over semistructured data - Scalability
A parallel runtime query execution engine, Apache Hyracks, that has been scale-tested on up to 1000+ cores and 500+ disks - Native storage
Partitioned LSM-based data storage and indexing to support efficient ingestion and management of semistructured data - External storage
Support for query access to externally stored data (e.g., data in HDFS) as well as to data stored natively by AsterixDB - Data types
A rich set of primitive data types, including spatial and temporal data in addition to integer, floating point, and textual data - Indexing
Secondary indexing options that include B+ trees, R trees, and inverted keyword (exact and fuzzy) index types - Transactions
Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store