NSE(Native Storage Extension) Data Tiering Options
What is NSE ?
HANA SPS 04 version has introduced the NSE. NSE is used to store the warm data. HANA was used to store the hot data in memory but as data growth occurred in some of the organization the need for another store came into picture and SAP came out with a solution to introduce the another store called as warm store which in turn called as NSE.
Please refer the below block diagram by SAP :
Customers implementing the SAP NSE solution:
Customers who are planning to think about implementing the NSE should closely monitor the data growth. They need to compare the total memory size vs the used size.
In our system we found the database growth of 61.91% per year so we moved towards implementing the NSE.
In-Memory/Hybrid/On-Disk – NSE has the feature to store the data in disk-based column store and Hana has the feature to store the data in in-Memory column store.So it has a hybrid column store approach.
NSE integration is based on the Hana Persistence layer in close connection to Page Access and Resource Manager.
Buffer Cache ( BC ) is required for performance access to pages on disk.The buffer cache should avoid redundant I/O operations by keeping pages which are access frequently in memory rather than reading them from the disk repeatedly.The Buffer Cache uses LRU (Last Recently Used) and HBL (Hot Buffer List) strategies and reuses the pages from the internal pools instead of allocating/deallocating pages via HANA memory management.
- • HANA as an in-memory database executes queries with allocating transient data and interim results in memory. Queries do not page interim results or parts of interim results from memory to disk.
• HANA keeps NSE data in memory in a buffer cache. Low hit rates in the buffer cache can cause insufficient query performance due to high number of disk reads.
• SAP provides general guidelines about the buffer cache sizing in the SAP HANA Administration Guide for SAP HANA Platform. Deviations from the guidelines require application based sizing or proof of concepts with workload simulations.
• Users can store warm data in NSE instead of on Dynamic Tiering. In contrast to Dynamic Tiering, the query execution in the HANA service, storing the NSE data, creates transient data and interim result in memory only. Thus, the memory requirement for a comparable workload can be higher with NSE. A solution to migrate data from Dynamic Tiering to NSE in on the road map for SAP HANA.
• SAP HANA NSE is for scale-up systems. For scale-out systems, SAP HANA does not check if users create tables with page-loadable columns or convert tables to page-loadable.
• SAP HANA NSE supports partition load units for heterogeneous partitions but does not support partition load unit for non-heterogeneous partitions.
• Specifying partition-level load unit is supported for the following partitioning schemes:
For all other partitioning schemes used in SAP NSE tables, load units can be specified only on column, table, and index.
Hana Data Tiering Options are as follows:
- Hot Store
– Persistent Memory
- Warm Store
– Native Storage Extension , Extension Node, Dynamic Tiering
- Cold Store
– Spark Controller
NSE Value Proposition and Use Case:
- Value proposition:
- Increase HANA data capacity at low TCO
- Deeply integrated warm data tier, with full HANA functionality
- Will support all HANA data types and data models
- Simple system landscape
- Scalable with good performance
- Supported for both HANA on-premise and HANA-as-a-Service (HaaS)
- Available for any HANA application
- Complements, without replacing, other warm data tiering solutions (extension nodes, dynamic tiering)
- Use cases:
- Any customer built or SAP built HANA application that is challenged by growing data volumes
- S/4HANA data aging (NSE is an evolution of “paged attributes”)
- BW team currently uses extension nodes, but they communicated in TECH ED 2019 that NSE is certified for BW/4HANA
Specifying data as “page loadable”
- Data may be specified as “page loadable” at table level, partition level, and column level
- Data may be converted between “page loadable” and “column loadable”
- NSE supports range, range-range, and hash partitioned tables
- For hash partitioning the entire table or column must be page loadable or column loadable
NSE Technical Overview:
- The HANA column store and row store each have a buffer cache.
- Column loadable data is fully loaded into memory from disk.
- Page loadable data is loaded from disk into the buffer cache, page by page as needed.
- Converting column/row loadable data to page loadable format moves the data into the buffer cache.
- When buffer cache is full, it will eject pages intelligently based on user access patterns.
- Warm and hot data are written together from main store to disk during normal save point operations. The write-optimized store is not paged
- Configure buffer cache size (on-premise only; HaaS will configure this for the user)
- Configure tables, columns, and partitions as “page loadable”
- Monitor buffer cache usage and capacity
- Report on resident memory status for page loadable data
- Includes rule-based “recommendation engine” to monitor user data access patterns.
- Based on statistics, the engine will advise user on which tables, columns, or partitions would benefit from being converted to “page loadable”
Data Lifecycle Manager (DLM):
- DLM tool will allow user to convert tables, columns, and table partitions between “column loadable” and “page loadable”
- Visualized query plan will display when warm data is accessed from NSE in order to satisfy the query
- HANA system must be scale up (first release)
- Determine volume of warm data to add to the HANA database
- May add as much warm storage as desired – up to 1:4 ratio of HANA hot data in memory to warm data on disk
- NSE disk store should be no larger than 10TB for first release of NSE
- Divide volume of warm data by 8 – this is size of memory buffer cache required to manage warm data on disk
- Either add more HANA memory for buffer cache, or use some of existing HANA memory for buffer cache (will reduce hot data volume)
- Work area should be same size as hot data in memory (equivalent to HANA with no NSE)
SAP HANA Extension Node – Whats New in SPS04
- HANA node in the scale-out landscape is reserved for warm-data storage and processing
- Supports all HANA operations and data management features
- Allows larger data footprint of up to 200% of the node DRAM size
- HANA persistent memory is supported
- Benefits from new partitioning and scale-out features in SPS04:
–range-hash partitioning scheme
–“pinning” tables on fixed HANA nodes
Warm Store Options – Getting Started
Which Data Tier Should I Use ?
I tried to explain the best way of handling the NSE and also show case how to Select the NSE for each customers.In my next article I will try to articulate all the technical changes required for the NSE and also introduce the data aging concepts which is key to NSE.
SAP HANA offers with NSE another warm data tiering option which is completely integrated and can be seamlessly used out of the box since SP40.
While adapting existing column store persistence building blocks to handle the new advanced page attribute behavior compared to the already existing PA every other component in Hana is able to work as designed.
As NSE will serve under warm data high query performance KPIs will logically not be reached. Since loading data from Disk to Memory will take its time.
Data will be held in the so-called Buffer Cache with an initial value of 10 % of the HANA memory and NSE Advisor will help to find objects which should be converted to page loadable.