Technology Blogs by Members
Explore a vibrant mix of technical expertise, industry insights, and tech buzz in member blogs covering SAP products, technology, and events. Get in the mix!
Showing results for 
Search instead for 
Did you mean: 
This is an utility that worked very well in converting  HTML to Jupyter Notebook IPYNB with all the Author's text for 3 SAP blogs I tried

Will be very helpful for many blogs with the current thrust in Data Science and Data Engineering; reader wishes to try but copy paste painful
Without lots of comments (read markdown) only code is almost useless!

Did a lot of searching and found NOTHING that met my needs
Nearest was
His notebook is in

I converted that to adapted for SAP Blogs where code is
inside pre tags as you can see in the HTML files

marsja,py works but gives a notebook with only code cells;
to my mind not too helpful; uses beautifulsoup and lxml packages

Please head to my repository

My gives exactly what most people need
A python notebook with lots of markup

I used the excellent package html2text which you need to install
pip install html2text
Documentation in

2nd package you do not need to install is py2nb
Wonderful compact but delivered as a python script
I had to copy paste in my program
Have informed Author about the 3 Issues that compelled me to copy

I ran 4 commands

# APL1 Hands-On Tutorial: Automated Predictive (APL) in SAP HANA Cloud
python "" APL1

# PAL1 Hands-On Tutorial: Leverage SAP HANA Machine Learning in the Cloud through the Predictive Analysis Library
# Author has CODE as images
# He has provided the ipynb from github
python "" PAL1

# APL2 Multiclass Classification with APL (Automated Predictive Library)
python "" APL2

# APL2 as bare code by
python "" APL2

The output files are in my repository
You should examine at least APL1 if you wish to use and adapt

Github has excellent jupyter notebook rendition
See these
# output of

# output of ONLY CODE no FUN!

# executed with editing just user ML_USER and connection MYHANACLOUD

my saved connection is MYHANACLOUD and saved user ML_USER

Not fortunate enough to have Cloud BTP access and I have a P-ID
so I used HANA EXPRESS in my personal docker

I hope many use this utility which I wrote definitely for my self

For external notebooks where HTML is not as "nice" as SAP Blogs
you can adapt the python program by looking at the HTML.txt
Find how the code cells are organized in the HTML
Skill in Python REGEX will help a lot

Labels in this area