
The Data Analysis Tool is a html+javascript based utility used by administrators, tuning experts, support engineers and consultants to create professional graphs based various types of input data.
The tool is fully automatic and significantly faster than Excel at creating scatterplots, bar charts and line plots together with histogram analysis for almost any column separated input data such as NMON (CPU_ALL, MEM, MEM_NEW, PROC, LPAR, PAGE), VMSTAT, SDFMON, TAANA, ABAPMETER, MHTML, NIPING, NFSIOSTAT and XLSX.
DataAnalysis.HTML will run in all modern web browsers like Chrome, Edge, Firefox, Opera, Safari (Windows / Linux / MacOS / iPadOS).
Even very large files of 100MB like NMON, NIPING, SDFMON can be opened in a few seconds.
Technically Data Analysis is a HTML page with Javascript which can open local files. The data is then visualized using the open-source JavaScript charting library plotly.js v2.35.2 - (C)opyright 2012-2019, Plotly, Inc. Licensed under the MIT license (view the source on GitHub.).
The purpose of this document is providing some additional screenshots and details explaining the usage of Data Analysis.
Data Analysis can only handle local files and does not allow any direct connection to any database. The data to be analyzed must be downloaded beforehand to a local PC.
Data.Analysis.offline.v15.14.zip can be downloaded via SAP Note 3169320 - Data Analysis
Documentation
The full documentation of Data Analysis is attached to SAP Note 3169320 - Data Analysis
Data.Analysis.HTML allows to open and concatenate multiple input files, it will detect automatically the column separator (tab, semicolon, comma..) - any character not included in 0..9, A..Z, a..z or space will be detected. To load data into the Data Analysis tool you have three options:
If files are too big to load you can try to remove white spaces using the following command sequence before opening the file with Data Analysis:
1. open power shell, then execute command bash
2. execute command: tr -d '[:blank:]' < inputfile > outputfile (for a 2GB file this should take around 1min, which is faster than most editors)
On the second screen one can review the data (if required specify the date format) and edit the column headers if required or go directly to the graphic display.
The last line in the table is indicating what type of data has been detected and how many distinct values exist in each column - the format is <type> / <number>
- text value
d date
t time
n numeric
The user has some options to manipulate the input data (the buttons are only available if the user is putting the cursor into the header line of a column)
In some occasions the program might not be able to determine the date format automatically, in that case the user can choose from different date formats.
Additionally, the user can download the data (tab separated text file with extension .xls). To reduce the amount of data, columns which do not contain significant information (all values identical) can be removed. If files are too big then the user can also remove every 2nd data row.
Below the various options to customize the graphic output
Additional Options
On the graphic screen one can select multiple fields for the x-axis and y-axis, specify the number of points, colors, scaling and color coding intervals for the histogram. Both the x and y axis can be scaled interactively. Additional functionality is available if you hover the mouse over the graphic.
|
Available graph options are:
![]() | ![]() | ![]() |
Scatter plot | Bar chart | Line chart |
![]() | ![]() | ![]() |
Multi Line Charts | Area Charts | Stacked Area Charts |
![]() | ![]() | ![]() |
Grouped Line Charts (SDFMON per Date) | Stacked Bar Charts (SDFMON per Appl.Server) | Stacked Area Charts SDFMON per Appl.Server) |
Some data sources (NIPING, SDFMON...) might contain more data points than can be drawn on a chart without losing information because the points/lines/bars will overlap each other. Therefore, one can limit the number of points (by default this limit is set to 10000). If one wants to display more data points in the scatterplot you can specify the point size as shown in the example below
![]() | ![]() |
1000 data points each 5pt | 30000 data points each 3pt |
Line and bar charts with too many points cannot resolve properly - to smooth the data one can specify the number of data points for which and average should be calculated.
![]() | ![]() | ![]() |
Bar Chart (120 Data Points 12min average) | Area Chart (5 min average with 288 data points) | Line Chart (30 sec. average with 2880 data points) |
Depending on the file type, record length, number of columns the Data Analysis tool can load and parse approximately 140.000 records per second. The throughput of the graphic engine depends on the number of operations and fields but typically Data Analysis can calculate and draw around 15.000 measurement points per second. We can estimate the average total processing time T of the Data Analysis tool by T (ms) = R/1000 * 7 + P/1000 * 60 where R is the total number of records loaded, P the number of data points in the graphic.
Example(s):
Browser Performance:
There are minor differences between the various browsers (Safari, Chrome, Firefox, Opera, Edge) - on MacOS Safari is not recommended when loading more than 3GB of data. Since MacOS BigSur Safari has problems with memory usage - seems that GC does not work as good as in the other chromium-based browsers. However, for smaller files Safari is faster.
nmon Files: When very large amount of big nmon files need to be analyzed, consider using nmon compressor which can remove data records which are not needed for the analysis. The tool nmon compressor can achieve a data reduction of up to 80-90% resulting in significantly faster loading times with data analysis or nmon analyzer.
Performance load test on MacOS BigSur with 172 nmon files (661 MB with total of 247000 records and 10.000 data points in the graphic), Memory test with 207 nmon files with a total size of 7.1 GB
Browser | Load Time (sec) | Graphic refresh (ms) | Peak Memory usage (loading 7.1 GB of nmon files) |
Safari | 12.3 | 1146 | > 8.2 GB (crashed during loading because of high memory usage) |
Chrome | 14.2 | 1379 | 4.36 GB all files loaded in 62 seconds |
Edge | 14.6 | 1301 | 4.56 GB all files loaded in 66 seconds |
Opera | 14.8 | 1578 | 4.26 GB all files loaded in 66 seconds |
Firefox | 13.3 | 1281 | 5.06 GB all files loaded in 108 seconds |
The memory usage of Data Analysis tool depends on the file size and number of data points to be displayed. When we open a SDFMON file with 1.586.920 records the initial memory consumption reached 1GB of memory. During the creation of the graphic the total memory consumption of the browser (opera) increased to 11.5 GB of memory drawing all 1.586.920 data points (runtime 75 seconds). When we limit the number of data points to 200.000 the memory consumption reached only 2.1GB. The memory consumption scales with the number of data points drawn in the graphic and can be estimated at about 5-10 kB per data point.
Hope you find the blog helpful. The intention of this blog is to give an overview of the Data Analysis tool which can save a lot of time generating a graphic representation of performance data collected by SAP administrators especially when multiple files need to be processed.
While Data Analysis does not intend or claim to replace other professional tools like Excel, it offers a free alternative to visualize data, especially if the data structure itself is not in a tabular format like nmon, niping, trex... which would otherwise require a complex and time-consuming pre-processing and parsing.
Stay tuned for further enhancements and updates.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
13 | |
11 | |
8 | |
6 | |
6 | |
6 | |
5 | |
5 | |
5 | |
5 |