Sizing for Hana 2.0 Machine Learning Implementatio...

former_member767298 · ‎2021 Sep 22

We are currently on Hana 2.0 SP05 and planning to implement the Hana Machine Learning Algorithms (specifically PAL).

What guidelines are available for sizing the capacity for the SAP Hana PAL implementation?

dvankempen · ‎2021 Sep 23

Hi Shankar,

SAP HANA Cloud requires an additional vCPU (3 instead of 2) for running the scriptserver process (but so does the document store). Application Function Libraries (AFL) of which the Predictive Analysis Library (PAL) is part is a database implementation for machine learning (statistical analysis). PAL does not require GPUs. Usual sizing recommendations apply

To my knowledge there is no sizing document specific to PAL but let me ask around a bit

ChristophMorgen · ‎2021 Sep 23

Hi Shankar,

sizing of Machine Learning scenario is depending on many factors, algorithms, input data size / type / cardinality, expected performance, concurrency of algorithm invocations ... as you can see for the APL sizing (see here). As PAL offers many more algorithms, it is hardly to predict as generic as your inquiry is.

There are PAL SAP application- as well as customer and partner application scenarios, where the PAL (or APL) processing goes almost unseen on the system, other's where there is a monthly one-day peak in e.g. forecasting workload which requires significant additional resources.

One approach would be to schedule the PAL workload to times on the system where there is the required processing capacity available, or as I understand you are looking at an on-premise HANA 2.0 installation such workload could potentially also be offloaded to a HANA Cloud instance providing the additional capacity. Furthermore you can certainly guardrail the PAL/APL processing workload using HANA workload management (e.g. limit the threads available to a PAL invocation).

The inference using the PAL_*_PREDICT functions in supervised learning scenarios (regression, classification) should not impact your sizing, unless you seek to serve a larger number of PAL models in parallel using PAL model state (here).

I recommend to start prototype your scenario more specifically (regarding data, algorithms, etc.), that would be the basis and input for your sizing efforts.

Best regards,

Christoph

By Category

Related Content

Activity Groups

Industry Groups

Influence and Feedback Groups

Interest Groups

Location Groups

Customer Only Groups

Forums

Related Resources

Products

Learning and Support

About

My Account

My Account

Sizing for Hana 2.0 Machine Learning Implementation

Know the answer?

Need more details?

Accepted Solutions (0)

Answers (2)

Answers (2)