Contents | Previous | Next

Real Storage Sizes above 2G

This section presents a series of measurements that explore the performance characteristics of z/VM 3.1.0 when running the 64-bit CP in real storage sizes larger than 2 Gigabytes. The approach taken was to hold total storage constant, while varying the amount of that storage that is configured as real (central) storage versus expanded storage. 1 For each measured storage configuration, multiple measurements were done using various minidisk cache tuning settings. In this section, all results shown are "tuned MDC" cases where the MDC settings gave good results for that storage configuration. The next section, Minidisk Cache with Large Real Storage, focuses on the performance that results from various MDC tuning strategies.

Two sets of measurements are provided: one with total storage fixed at 8G and one with total storage fixed at 12G. The 8G size is small enough that some DASD paging results. With the 12G size, total storage is large enough that DASD paging is essentially eliminated and all remaining paging, if any, occurs to expanded storage.

All measurements were obtained on the same 2064-1C8 8-way configuration described on page , but with various configurations of real and expanded storage. There were 10,800 CMS1 users, driven by internal TPNS, resulting in an average processor utilization of about 90%. Hardware instrumentation, CP monitor, and TPNS throughput data were collected for each measurement.

For the measured 8-way configuration, the results show that increasing real storage beyond 2G does result in improved performance but that the improvements are only on the order of a few percent. This is to be expected because past large system measurements have consistently shown that CP is very efficient at using expanded storage as a place to temporarily put user pages that do not fit into real storage while that user is dormant (thinking) between requests. When the workload is so heavy that the pages needed by actively running users do not all fit into real storage, CP will start forming an eligible list to prevent thrashing between real and expanded storage. It is in that situation where the ability to configure real storage larger than 2G can result in dramatic performance improvements.

Total Storage: 8G

Measurements were obtained in storage configurations ranging from 2G real and 6G expanded (2G/6G) to 8G/0G. The results are summarized in Table 1 and Table 2. Table 1 shows the absolute results, while Table 2 shows the results as ratios relative to the 2G/6G base run.


Table 1. CMS1 with 8G Total Storage


Real Storage
Expanded Storage
MDC Real
MDC Xstor
Run ID


2G
6G
default
512M
E1228826


4G
4G
160M
512M
E0104845


6G
2G
202M
476M
E0104864


8G
0G
672M
0M
E0104880


Response Time
TRIV INT
NONTRIV INT
TOT INT



0.06
0.56
0.13



0.04
0.47
0.11



0.04
0.48
0.11



0.12
1.18
0.31


Throughput
ETR (T)
ITR (H)



1078.09
1208.71



1090.11
1216.00



1091.84
1235.03



1021.59
1264.48


Proc. Usage
PBT/CMD (H)
CP/CMD (H)
EMUL/CMD (H)



6.619
1.846
4.773



6.579
1.772
4.807



6.478
1.656
4.821



6.327
1.625
4.702


Processor Util.
TOTAL (H)
UTIL/PROC (H)
TOTAL EMUL (H)
TOTAL EMUL
TVR(H)
TVR



713.55
89.19
514.53
530.40
1.39
1.24



717.18
89.65
524.02
536.80
1.37
1.25



707.25
88.41
526.42
539.20
1.34
1.26



646.33
80.79
480.33
493.60
1.35
1.26


Paging
READS/SEC
WRITES/SEC
PAGE/CMD
PAGE IO RATE
PAGE IO/CMD
XSTOR IN/SEC
XSTOR OUT/SEC
XSTOR/CMD



207
263
0.44
52.70
0.05
6322
6792
12.16



0
85
0.08
0.30
0.00
3957
4214
7.50



33
125
0.14
5.60
0.01
526
661
1.09



355
424
0.76
76.00
0.07
0
0
0.00


I/O
RIO RATE
RIO/CMD
NONPAGE RIO/CMD
DASD RESP TIME
MDC REAL SIZE (MB)
MDC XSTOR SIZE (MB)
MDC TOTAL SIZE (MB)
MDC HIT RATIO



4635
4.30
4.25
6.7
160
511
671
96.6



4620
4.24
4.24
5.9
158
512
670
96.2



4622
4.23
4.23
5.9
200
476
676
96.3



4387
4.29
4.22
10.8
670
0
670
96.1


PRIVOPs
PRIVOP/CMD
DIAG/CMD



57.67
78.28



58.43
77.72



59.51
77.85



56.93
76.92

Note: 2064-1C8, 8 processors, 10800 users, internal TPNS, T=TPNS, H=Hardware Monitor, Unmarked=VMPRF


Table 2. CMS1 with 8G Total Storage - Ratios


Real Storage
Expanded Storage
MDC Real
MDC Xstor
Run ID


2G
6G
default
512M
E1228826


4G
4G
160M
512M
E0104845


6G
2G
202M
476M
E0104864


8G
0G
672M
0M
E0104880


Response Time
TRIV INT
NONTRIV INT
TOT INT



1.000
1.000
1.000



0.772
0.844
0.805



0.754
0.851
0.803



2.175
2.113
2.342


Throughput
ETR (T)
ITR (H)



1.000
1.000



1.011
1.006



1.013
1.022



0.948
1.046


Proc. Usage
PBT/CMD (H)
CP/CMD (H)
EMUL/CMD (H)



1.000
1.000
1.000



0.994
0.960
1.007



0.979
0.897
1.010



0.956
0.880
0.985


Processor Util.
TOTAL (H)
UTIL/PROC (H)
TOTAL EMUL (H)
TOTAL EMUL
TVR(H)
TVR



1.000
1.000
1.000
1.000
1.000
1.000



1.005
1.005
1.018
1.012
0.987
1.008



0.991
0.991
1.023
1.017
0.969
1.016



0.906
0.906
0.934
0.931
0.970
1.016


Paging
READS/SEC
WRITES/SEC
PAGE/CMD
PAGE IO RATE
PAGE IO/CMD
XSTOR IN/SEC
XSTOR OUT/SEC
XSTOR/CMD



1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000



0.000
0.323
0.179
0.006
0.006
0.626
0.620
0.616



0.159
0.475
0.332
0.106
0.105
0.083
0.097
0.089



1.715
1.612
1.749
1.442
1.522
0.000
0.000
0.000


I/O
RIO RATE
RIO/CMD
NONPAGE RIO/CMD
DASD RESP TIME
MDC REAL SIZE (MB)
MDC XSTOR SIZE (MB)
MDC TOTAL SIZE (MB)
MDC HIT RATIO



1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000



0.997
0.986
0.997
0.881
0.990
1.000
0.999
0.996



0.997
0.985
0.995
0.881
1.251
0.930
1.007
0.997



0.946
0.999
0.993
1.612
4.193
0.000
0.999
0.995


PRIVOPs
PRIVOP/CMD
DIAG/CMD



1.000
1.000



1.013
0.993



1.032
0.995



0.987
0.983

Note: 2064-1C8, 8 processors, 10800 users, internal TPNS, T=TPNS, H=Hardware Monitor, Unmarked=VMPRF

The best overall performance was achieved in the 6G/2G configuration. Relative to the 2G/6G base measurement, throughput (ETR (T)) was 1.3% better, while processor efficiency, as measured by PBT/CMD (H), was 2.1% better (the ratio is 0.979). The 8G/0G configuration showed even better processor efficiency (a 4.4% improvement relative to the base configuration) but throughput dropped by 5.2%, indicating an increase in external response time. 2 This finding is consistent with what we have observed in the past for smaller storage configurations. It benefits response time to configure some of the storage as expanded storage because it then serves as a high speed paging device that will tend to contain the most frequently needed pages, thus avoiding in many cases the much longer delay of waiting for pages to be brought in from DASD.

Although the 2G/6G base case ran fine with default real MDC tuning (real MDC size selected by the real storage arbiter, bias of 1), we learned that for real storage sizes greater than 2G it was important to constrain the real MDC size in some way in order to get the best performance. This is discussed further in Minidisk Cache with Large Real Storage. For these measurements, we decided to constrain the real storage MDC by setting it to a fixed size chosen such that the total MDC cache size (real MDC plus expanded MDC) is approximately equal to the size that resulted in the 2G/6G base run. This was done in an effort to eliminate total MDC size as a factor influencing overall performance in this series of measurements.

Total Storage: 12G

This total storage size is large enough that DASD paging is essentially eliminated. Measurements were obtained in storage configurations ranging from 2G/10G to 12G/0G. The absolute and relative results are summarized in Table 3 and Table 4 respectively.


Table 3. CMS1 with 12G Total Storage


Real Storage
Expanded Storage
MDC Real
MDC Xstor
Run ID


2G
10G
100M
300M
E0104822


4G
8G
200M
200M
E010484D


10G
2G
400M
none
E01048A6


12G
0G
400M
none
E01048C0


Response Time
TRIV INT
NONTRIV INT
TOT INT



0.04
0.46
0.10



0.04
0.47
0.11



0.04
0.46
0.10



0.04
0.55
0.12


Throughput
ETR (T)
ITR (H)



1090.05
1217.44



1090.58
1217.60



1093.19
1264.59



1090.92
1263.79


Proc. Usage
PBT/CMD (H)
CP/CMD (H)
EMUL/CMD (H)



6.571
1.772
4.799



6.570
1.763
4.807



6.326
1.521
4.805



6.330
1.542
4.788


Processor Util.
TOTAL (H)
UTIL/PROC (H)
TOTAL EMUL (H)
TOTAL EMUL
TVR(H)
TVR



716.29
89.54
523.14
535.20
1.37
1.24



716.54
89.57
524.25
536.80
1.37
1.26



691.57
86.45
525.26
538.40
1.32
1.24



690.57
86.32
522.34
535.20
1.32
1.24


Paging
READS/SEC
WRITES/SEC
PAGE/CMD
PAGE IO RATE
PAGE IO/CMD
XSTOR IN/SEC
XSTOR OUT/SEC
XSTOR/CMD



0
80
0.07
0.00
0.00
6320
6638
11.89



0
79
0.07
0.00
0.00
4005
4301
7.62



0
80
0.07
0.00
0.00
29
34
0.06



183
260
0.41
37.30
0.03
0
0
0.00


I/O
RIO RATE
RIO/CMD
NONPAGE RIO/CMD
DASD RESP TIME
MDC REAL SIZE (MB)
MDC XSTOR SIZE (MB)
MDC TOTAL SIZE (MB)
MDC HIT RATIO



4612
4.23
4.23
5.7
100
299
399
96.3



4718
4.33
4.33
6.1
199
200
399
95.3



4595
4.20
4.20
5.8
398
0
398
96.5



4623
4.24
4.20
6.5
398
0
398
96.5


PRIVOPs
PRIVOP/CMD
DIAG/CMD



58.20
77.68



58.62
77.81



59.63
77.79



59.36
77.81

Note: 2064-1C8, 8 processors, 10800 users, internal TPNS, T=TPNS, H=Hardware Monitor, Unmarked=VMPRF


Table 4. CMS1 with 12G Total Storage - Ratios


Real Storage
Expanded Storage
MDC Real
MDC Xstor
Run ID


2G
10G
100M
300M
E0104822


4G
8G
200M
200M
E010484D


10G
2G
400M
none
E01048A6


12G
0G
400M
none
E01048C0


Response Time
TRIV INT
NONTRIV INT
TOT INT



1.000
1.000
1.000



1.023
1.033
1.030



0.953
1.004
0.986



1.023
1.190
1.138


Throughput
ETR (T)
ITR (H)



1.000
1.000



1.000
1.000



1.003
1.039



1.001
1.038


Proc. Usage
PBT/CMD (H)
CP/CMD (H)
EMUL/CMD (H)



1.000
1.000
1.000



1.000
0.995
1.002



0.963
0.859
1.001



0.963
0.870
0.998


Processor Util.
TOTAL (H)
UTIL/PROC (H)
TOTAL EMUL (H)
TOTAL EMUL
TVR(H)
TVR



1.000
1.000
1.000
1.000
1.000
1.000



1.000
1.000
1.002
1.003
0.998
1.016



0.965
0.965
1.004
1.006
0.962
1.000



0.964
0.964
0.998
1.000
0.966
1.000


Paging
WRITES/SEC
PAGE/CMD
XSTOR IN/SEC
XSTOR OUT/SEC
XSTOR/CMD



1.000
1.000
1.000
1.000
1.000



0.988
0.987
0.634
0.648
0.641



1.000
0.997
0.005
0.005
0.005



3.250
5.533
0.000
0.000
0.000


I/O
RIO RATE
RIO/CMD
NONPAGE RIO/CMD
DASD RESP TIME
MDC REAL SIZE (MB)
MDC XSTOR SIZE (MB)
MDC TOTAL SIZE (MB)
MDC HIT RATIO



1.000
1.000
1.000
1.000
1.000
1.000
1.000
1.000



1.023
1.022
1.022
1.070
1.994
0.668
1.000
0.990



0.996
0.993
0.993
1.018
3.997
0.000
0.997
1.002



1.002
1.002
0.994
1.140
3.997
0.000
0.997
1.002


PRIVOPs
PRIVOP/CMD
DIAG/CMD



1.000
1.000



1.007
1.002



1.025
1.001



1.020
1.002

Note: 2064-1C8, 8 processors, 10800 users, internal TPNS, T=TPNS, H=Hardware Monitor, Unmarked=VMPRF

With 12G total storage, the 10G/2G configuration showed the best overall performance, although 12G/0G performed just about as well. Unlike what we saw with 8G total storage, the no expanded storage case (12G/0G) did not experience any appreciable throughput decrease relative to the configurations that have expanded storage. This is because 12G is large enough that, for this workload, there is little DASD paging to cause response time delays.

For this measurement series, we chose to keep total MDC size constant at 400M so as to minimize the effects of minidisk cache size changes on the performance results. We found that 400M was more than sufficient and resulted in an excellent MDC hit ratio (around 96%).


Footnotes:

1
Storage can still be defined as real or expanded storage on the zSeries 900 processors and z/VM continues to support expanded storage.

2
External response time data requires TPNS logging, which was disabled for these measurements as a simplification. However, the throughput measured by TPNS (ETR (T)) is a good indicator of what is happening to external response time. This is because throughput per user is 1 command / (think time + response time). Average think time is constant for this workload and, as a result, TPNS throughput varies inversely with average external response time.

Contents | Previous | Next