In the results below, the starting sector appears in the "machine name" column.
Version 1.96 ------Sequential Output------ --Sequential Input- --Random- Concurrency 1 -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 63 8G 28564 6 7040 1 10828 0 165.9 3 Latency 1235ms 719ms 87156us 824ms 64 8G 29969 6 15526 2 44117 2 200.5 3 Latency 1126ms 460ms 24127us 764ms 65 8G 28594 6 7067 1 11186 0 173.9 2 Latency 1171ms 433ms 719ms 805ms 72 8G 30110 6 15475 2 44907 2 201.5 3 Latency 1169ms 712ms 21640us 795ms Generated with a command such as `bonnie++ -m65 -d /mnt -f -n 0 | tee -a bonnie.log`
Conclusions: Sequential writes are unaffected, but random writes and sequential reads are significantly affected by having the correct alignment. I was surprised by the large effect on sequential reads. 8 sectors is the right factor; no more is required.
Version 1.97 ------Sequential Output------ --Sequential Input- --Random- Concurrency 1 -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 4096s 31920M 134096 11 39904 5 174552 9 223.8 4 Latency 174ms 459ms 124ms 5340ms 4095s 31920M 111141 10 25646 3 165830 8 171.4 3 Latency 362ms 309ms 380ms 7463ms 1.97,1.97,4096,1,1444665581,31920M,,,,134096,11,39904,5,,,174552,9,223.8,4,,,,,,,,,,,,,,,,,,,174ms,459ms,,124ms,5340ms,,,,,, 1.97,1.97,4095,1,1444703284,31920M,,,,111141,10,25646,3,,,165830,8,171.4,3,,,,,,,,,,,,,,,,,,,362ms,309ms,,380ms,7463ms,,,,,,