cancel
Showing results for 
Search instead for 
Did you mean: 

Sci-Kit learn library in SAP Datasphere Data Flow

Moritz_F
Explorer
0 Kudos
139

Hi everyone,

I tried to add Python code into a Sap Datasphere Data Flow utilizing the sci-kit learn library. However I always get errors. Does anyone know if the library is available in the Dataflow Script Console or what alteratives I would have here?

Many thanks,

Moritz

Accepted Solutions (0)

Answers (2)

Answers (2)

AndreasForster
Product and Topic Expert
Product and Topic Expert
0 Kudos

Hi @Moritz_F, the Python operator in a Datasphere Data Flow supports only the Python packages listed here. Further packages cannot be installed. Please also consider that the Python operator receives the data in packages. You are not guaranteed to get all data at once. Hence I see two options:

  1. Consider whether your scikit-learn logic can be implemented using the HANA Machine Learning, that is part of Datasphere's HANA Cloud. Here is an example https://community.sap.com/t5/artificial-intelligence-and-machine-learning-blogs/hands-on-tutorial-ma...
  2. Alternatively you would need to extract the data to a Python environment (ie in BTP: Jupyter Notebooks in Business Application Studio or Python in Cloud Foundry) and write any output back to Datasphere
Githen_Ronney
Explorer
0 Kudos

Hi Moritz,

Numpy and Pandas libraries are supported in script component of Datasphere Dataflow.

https://help.sap.com/docs/SAP_DATASPHERE/c8a54ee704e94e15926551293243fd1d/73e8ba1a69cd4eeba722b458a2...

Regards

Githen