QA: Add benchmarking system to test speed of different parts of Piwik in different situations #3177

diosmosis · 2012-06-03T10:01:41Z

When optimizing or doing major re-factoring it's necessary to run performance tests to make sure new changes are as fast or faster than the unchanged code. There should be a system in place to allow this sort of testing.

Some ideas on how benchmarking should be done:

Should reuse as much of testing infrastructure as possible.
Since most benchmarks will require a large amount of data and thus will take a long time to run, the benchmarks should not be run with the tests.
If possible, the benchmarks should communicate with the browser when the test data has finished being added. (I think this would have to be done via forking & IPC.)
Allow the amount of sites/visits/goals/etc. to be configurable (via the browser) for each benchmark.
If possible, find some way to quickly load test data.
Integrate some or all benchmarks w/ Jenkins. They shouldn't be run w/ the other tests, but it should be possible to run them through Jenkins and get some performance analytics.

diosmosis · 2012-06-03T10:07:12Z

Attachment: Patch for this issue.
3177.diff.tar.gz

diosmosis · 2012-06-03T10:07:35Z

Attachment: Screenshot of patch.
screenshot.pdf

diosmosis · 2012-06-03T10:12:33Z

Attachment: Real screenshot. Ignore the other one.
screenshot.png

diosmosis · 2012-06-03T10:23:39Z

Added a patch for this issue. It contains a system for benchmarking and the following benchmarks:

Testing archiving for one site w/ 10,000 page views in one day.
Testing archiving for one site w/ 10,000 page views over an entire year.
Testing archiving for 1000 sites w/ 12 page views each in one day.
Testing archiving for 1000 sites w/ 12 pave views over an entire year.
Testing tracking of 1000 page views.

I didn't test all of them since they can take up to ~40 mins to set up, but here are the results of the ones I did test:

Tracking 1000 page views: 242s
One website w/ 10,000 page views in one day: ~12.5s
1000 websites w/ 12 page views, all in one day: ~178.6s

Some other notes:

Uses Bootstrap via a link to http://twitter.github.com/bootstrap/ .
This won't inform you when the setup is done (or if the setup is running).

mattab · 2012-06-18T19:55:55Z

Really interesting concept -- it has some value as it is as it allows to manually compare Before and After a particular change. It would be VERY nice to run it on jenkins as part of the broader plan in: #2000

For example we could create a new build on http://qa.piwik.org:8080/jenkins/ called "Performance testing", that would run the performance tests and report the time spent in each test. Ideally we would reuse some tool that would let us draw graphs to see how the performance is evolving across releases maybe?

diosmosis · 2012-06-19T05:20:03Z

Attachment: New patch. Improved speed for adding visits.
3177.diff.tar.2.gz

diosmosis · 2012-06-19T05:23:38Z

I added a new patch for this issue w/ improved speeds for adding visits. It essentially bypasses CURL for adding visits. I've added a check for the useLocalTracking query param in the integration tests so the improved speed can be used when running tests. For me, running all_tests.php?useLocalTracking=true sped things up from ~640s to ~470s.

Let me know what you think.

Replying to matt:

Really interesting concept -- it has some value as it is as it allows to manually compare Before and After a particular change. It would be VERY nice to run it on jenkins as part of the broader plan in: #2000

For example we could create a new build on http://qa.piwik.org:8080/jenkins/ called "Performance testing", that would run the performance tests and report the time spent in each test. Ideally we would reuse some tool that would let us draw graphs to see how the performance is evolving across releases maybe?

Do you think it would be a good idea to store the results in the piwik_log_profiling table? Then the graphs could be made from the data there. Or maybe we could create a Jenkins plugin...

mattab · 2012-07-19T00:16:43Z

Code review

Good refactoring & first commmit of Performance Regression Testing framework: Well Done!! :)
setTestEnv -> setTestingEnvironment()

Do you think it would be a good idea to store the results in the piwik_log_profiling table? Then the graphs could be made from the data there.

For sure, if the table is good enough then please reuse it, that woudl be great! we could then write the performance report as piwik plugin (and even export it as HTML via the report publisher #3118 :)

You can commit as it is but definitely we need some kind of Reporting in Jenkins or on a webpage to keep an eye on performance :) Not sure what the best or easiest way would be?

diosmosis · 2012-07-22T18:00:18Z

(In [6533]) Refs #3177, added benchmarking system and set of benchmarks to test Piwik performance.

Notes:

Added LocalTracker tracker that can be used w/ tests & benchmarks to speed things up.
Modified HTTP header sending code to use intermediate utility function.

diosmosis · 2012-07-22T18:18:03Z

I committed the benchmarking system, however, there's still more to do. Specifically, the benchmarks are pretty still slow (though manageable), and of course, there needs to be some way to integrate w/ Jenkins. Here are my ideas:

Speeding Things Up

I think instead of running setup each time the benchmark is run, we can run it once in a database named after the test case, then keep the database (ie, don't drop it) so the setup method won't have to be run again.

Integrating With Jenkins

I've looked at some plugins for Jenkins and found two that might be useful for tracking performance:

https://wiki.jenkins-ci.org/display/JENKINS/Performance+Plugin (this one looks better)
https://wiki.jenkins-ci.org/display/JENKINS/PerfPublisher+Plugin

Both of these process XML to get performance data. The benchmarks could be setup to output this XML when certain query parameters are used, and, if possible, Jenkins could get this data and feed it to the plugin. However, I have no experience setting up/configuring Jenkins, so I don't know what this will entail. Or if it's possible.

What do you think?

diosmosis · 2012-07-22T18:20:49Z

(In [6534]) Refs #3177, fixing build.

mattab · 2012-07-23T09:41:26Z

Code review

Why is the change in Url.test.php required to add the port? I'd like if the test was not adding the port if it is 80 by default as maybe it would create confusing URLs with port for users?
- Going to execute the benchmarks individually eg. http://localhost/trunk/tests/benchmarks/OneSiteTwelveThousandPageViews_Year.benchmark.php
  I get errors ) Warning: require_once(piwik\svn\trunk/trunk/tests/benchmarks/OneSiteTwelveThousandPageViews_Year.benchmark.php) [function.require-once]: failed to open stream: No such file or directory in piwik\svn\trunk\tests\benchmark_runner.php on line 18
  Does it work for you?
Are the tests enabled by default? Since there is no reporting at the moment, please check they are disabled and not slow down the build. Once we have some reports on the time spent in each test, the size of the DB, and graphs to see evolution across builds, we can leave them disabled - ok?

I think instead of running setup each time the benchmark is run, we can run it once in a database named after the test case, then keep the database (ie, don't drop it) so the setup method won't have to be run again.

I think setup/teardown will be pretty small time compared to the test running? I think it's OK as it is (deleting/recreating the tables)

Both of these process XML to get performance data. The benchmarks could be setup to output this XML when certain query parameters are used, and, if possible, Jenkins could get this data and feed it to the plugin. However, I have no experience setting up/configuring Jenkins, so I don't know what this will entail. Or if it's possible.

Looks nice - Would this Jenkins plugin store the value for each build automatically?

Could we plot simple graphs like "measure time to run benchmark" for each benchmark + total benchmark time, using this plugin?

diosmosis · 2012-07-23T12:04:25Z

(In [6535]) Refs #3177, fix require bug when loading single benchmark group.

diosmosis · 2012-07-23T12:06:21Z

Replying to matt:

Code review

Why is the change in Url.test.php required to add the port? I'd like if the test was not adding the port if it is 80 by default as maybe it would create confusing URLs with port for users?

The Url.test.php changes are used because adding a query parameter (like useLocalTracking=true) to all_tests.php will cause some checks to fail. The port change was for Jenkins, since it doesn't use port 80.

Going to execute the benchmarks individually eg. http://localhost/trunk/tests/benchmarks/OneSiteTwelveThousandPageViews_Year.benchmark.php
I get errors ) Warning: require_once(piwik\svn\trunk/trunk/tests/benchmarks/OneSiteTwelveThousandPageViews_Year.benchmark.php) [function.require-once]: failed to open stream: No such file or directory in piwik\svn\trunk\tests\benchmark_runner.php on line 18
Does it work for you?

Fixed the bug. BTW, if you load benchmark_runner.php directly, it'll show every benchmark.

Are the tests enabled by default? Since there is no reporting at the moment, please check they are disabled and not slow down the build. Once we have some reports on the time spent in each test, the size of the DB, and graphs to see evolution across builds, we can leave them disabled - ok?

Benchmarks will not get loaded when running tests since the files don't end w/ .test.php. If they did, Jenkins would never complete a build :).

I think instead of running setup each time the benchmark is run, we can run it once in a database named after the test case, then keep the database (ie, don't drop it) so the setup method won't have to be run again.

I think setup/teardown will be pretty small time compared to the test running? I think it's OK as it is (deleting/recreating the tables)

The tests for the ones I ran take ~1-4mins. The setup takes between 7-11mins when using useLocalTracking=true. W/o local tracking, it can take > 40 mins.

Both of these process XML to get performance data. The benchmarks could be setup to output this XML when certain query parameters are used, and, if possible, Jenkins could get this data and feed it to the plugin. However, I have no experience setting up/configuring Jenkins, so I don't know what this will entail. Or if it's possible.

Looks nice - Would this Jenkins plugin store the value for each build automatically?

Could we plot simple graphs like "measure time to run benchmark" for each benchmark + total benchmark time, using this plugin?

Not sure. I was hoping someone w/ more experience w/ Jenkins would be able to more easily figure that out.

robocoder · 2012-07-23T16:55:21Z

From what I understand is being said, I don't believe the port change is necessary. Jenkins runs in a Jetty container under port 8080. On the CI server, we run php-cgi through Apache at port 80.

diosmosis · 2012-07-23T17:27:51Z

Replying to vipsoft:

From what I understand is being said, I don't believe the port change is necessary. Jenkins runs in a Jetty container under port 8080. On the CI server, we run php-cgi through Apache at port 80.

Sorry, I wasn't clear enough, by 'port change' I meant 'change to Url.test.php that deals w/ the port in the current url'. No ports were actually changed :)

mattab · 2012-07-24T01:12:36Z

Replying to capedfuzz:

Replying to vipsoft:

From what I understand is being said, I don't believe the port change is necessary. Jenkins runs in a Jetty container under port 8080. On the CI server, we run php-cgi through Apache at port 80.

Sorry, I wasn't clear enough, by 'port change' I meant 'change to Url.test.php that deals w/ the port in the current url'. No ports were actually changed :)

Maybe the question is: Why is this change now necessary?

@vipsoft, do you have any experience with Jenkins plugins and performance reporting?

diosmosis · 2012-07-24T07:48:09Z

Replying to matt:

Replying to capedfuzz:

Replying to vipsoft:

From what I understand is being said, I don't believe the port change is necessary. Jenkins runs in a Jetty container under port 8080. On the CI server, we run php-cgi through Apache at port 80.

Sorry, I wasn't clear enough, by 'port change' I meant 'change to Url.test.php that deals w/ the port in the current url'. No ports were actually changed :)

Maybe the question is: Why is this change now necessary?

The change to Url.test.php is in [6533]. Originally it tested getCurrentUrl() == getCurrentUrlWithoutQueryString(). If you use a query parameter like useLocalTracking=true, however, the test will fail. I modified it to use parse_url manually, so we can use useLocalTracking=true on all_tests.php (to speed things up).

What I didn't do was make sure the port in the current url was added to the test's expected url. Which caused the test to fail in jenkins, since it uses port 8080. So I committed a fix in [6534].

mattab · 2012-07-25T04:00:01Z

Thanks @capedfuzz for the clarification.

Do you think this ticket should be closed?
We could open a new one for the dev of the Jenkins Cross Build plugin.

diosmosis · 2012-07-25T10:20:20Z

Replying to matt:

Thanks @capedfuzz for the clarification.

Do you think this ticket should be closed?
We could open a new one for the dev of the Jenkins Cross Build plugin.

There's still the issue that its a bit too slow to run a lot of benchmarks. I can make using persisted DB data an option (not dropping a test data). Also, still have to try w/ the database dump.

But other than that, the issue looks done.

mattab · 2012-07-27T02:15:11Z

See Follow up ticket: #3280 to write some kind of reports and graphs to get Performance across several builds, and see if and when the software becomes faster/slower.

diosmosis · 2012-07-28T16:46:09Z

(In [6580]) Refs #3177, fix small bug in benchmark runner.

sgiehl · 2012-07-28T17:20:03Z

The benchmark tests are using the simpletest library, right? I'm currently working on migrating all tests to PHPUnit. Most core and plugin tests are already migrated, some integration tests are still missing. The tests using simpletest will be removed when I'm done. Is there a way to migrate the performance tests to phpunit or something else?

diosmosis · 2012-07-28T17:37:51Z

Replying to SteveG:

The benchmark tests are using the simpletest library, right? I'm currently working on migrating all tests to PHPUnit. Most core and plugin tests are already migrated, some integration tests are still missing. The tests using simpletest will be removed when I'm done. Is there a way to migrate the performance tests to phpunit or something else?

For the most part they depend on the Integration base test case class. Simpletest itself isn't really used. The only other requirements are that:

loading a benchmark file in the browser should not run the tests, but load benchmark_runner.php
benchmark_runner.php should work (should only need to change the requires at the top of the file)

I haven't been keeping up w/ the PHPUnit changes, though. Do you think this would still be possible?

diosmosis · 2012-08-20T03:43:56Z

Attachment: benchmarking w/ phpunit
3290.diff.tar.gz

diosmosis · 2012-08-20T03:55:50Z

I've uploaded a patch that implements benchmarking w/ the PHPUnit test infrastructure. I've modified visualphpunit to show the total execution time of the executed tests and made it possible to set GLOBAL variables before running a test through the browser. To run a benchmark, you select a benchmark, and set the PIWIK_BENCHMARK_FIXTURE global variable. Then you run the test and it'll tell you how long it took.

Some notes on this implementation:

I've separated the type of visits from the test type. So to run the ArchiveProcessing benchmark for one site w/ 12000 visits, you'd set PIWIK_BENCHMARK_FIXTURE to 'OneSiteTwelveThousandPageViewsOneDay'. If you want to run it for 12000 sites, over a year, you'd name a different type.
I've made it possible to save the test data into a separate database so the setup doesn't have to be run more than once.

It's a bit preliminary, and I don't think it's quite working (running the archiving process w/ 12000 visits takes .02s...). Matt, what do you think of my patch? And Stefan, do you see a problem w/ the way I've implemented it (in terms of using PHPUnit)?

diosmosis · 2012-08-26T15:11:58Z

Attachment: Another patch for phpunit benchmarking.
3290.diff.tar.2.gz

diosmosis · 2012-08-26T15:13:38Z

I uploaded a new patch. It's been tested and provides several more fixture types and a benchmark for the Tracker. The SqlDump fixture uses a hardcoded URL, but there's currently no publicly hosted SQL dump to use, so it will have to be changed.

Let me know what you think of it.

diosmosis · 2012-08-26T15:19:30Z

Another note about my patch:

To run a benchmark, go into visualphpunit and select a benchmark to run.

Then in the 'GLOBALS' section on the left panel, enter two key-value pairs:

'PIWIK_BENCHMARK_FIXTURE' => the name of the fixture to use, such as 'OneSiteTwelveThousandVisitsOneDay'

(optional) 'PIWIK_BENCHMARK_DATABASE' => the name of the database to permanently store data in. if not set, the fixture setup will be run every time, which will slow things down if you run benchmarks more than once.

Then run the test. Be sure to only run one benchmark at a time.

I think after I commit, I'll have to edit the README.

mattab · 2012-09-06T12:17:47Z

Looks good to me, nice work!

I would just suggest to remove some fixtures, and keep the 3 most useful ones?

Also as per email, please prepare a comprehensive DB dump for the test case for #3330

diosmosis · 2012-09-09T00:11:41Z

Attachment: Ugly CoffeeScript script used to generate visits.
generate.coffee

diosmosis · 2012-09-09T23:00:45Z

(In [6954]) Fixes #3177, added benchmarking system that uses phpunit and visualphpunit.

diosmosis · 2012-09-09T23:04:11Z

(In [6955]) Refs #3177, set svn property.

diosmosis · 2012-09-20T22:33:19Z

(In [7027]) Refs #3290, #3177 added XHProf support to VisualPHPUnit. Can now profile tests & benchmarks through browser.

diosmosis · 2012-09-20T23:15:05Z

(In [7028]) Refs #3177, #3290 add XHProf to tests/lib and tweak visualphpunit display.

diosmosis · 2012-09-21T00:47:41Z

(In [7029]) Refs #3177, add README section for benchmarking system.

diosmosis · 2012-09-21T04:37:13Z

(In [7031]) Refs #3177, remove some unnecessary XHProf files and add a section to the tests README describing how to use XHProf.

diosmosis added this to the 1.x - Piwik 1.x milestone Jul 8, 2014

diosmosis added T: Task labels Jul 8, 2014

mattab mentioned this issue Jan 13, 2014

Jenkins Report for Performance Benchmarks #3280

Closed

diosmosis mentioned this issue Sep 20, 2012

Setup VisualPHPUnit to run new PHPUnit tests in browser #3290

Closed

mattab mentioned this issue May 12, 2015

Performance regressions & Automated speed testing for Piwik #7889

Closed

This issue was closed.

QA: Add benchmarking system to test speed of different parts of Piwik in different situations #3177

QA: Add benchmarking system to test speed of different parts of Piwik in different situations #3177

Comments

diosmosis commented Jun 3, 2012

diosmosis commented Jun 3, 2012

diosmosis commented Jun 3, 2012

diosmosis commented Jun 3, 2012

diosmosis commented Jun 3, 2012

mattab commented Jun 18, 2012

diosmosis commented Jun 19, 2012

diosmosis commented Jun 19, 2012

mattab commented Jul 19, 2012

diosmosis commented Jul 22, 2012

diosmosis commented Jul 22, 2012

diosmosis commented Jul 22, 2012

mattab commented Jul 23, 2012

diosmosis commented Jul 23, 2012

diosmosis commented Jul 23, 2012

robocoder commented Jul 23, 2012

diosmosis commented Jul 23, 2012

mattab commented Jul 24, 2012

diosmosis commented Jul 24, 2012

mattab commented Jul 25, 2012

diosmosis commented Jul 25, 2012

mattab commented Jul 27, 2012

diosmosis commented Jul 28, 2012

sgiehl commented Jul 28, 2012

diosmosis commented Jul 28, 2012

diosmosis commented Aug 20, 2012

diosmosis commented Aug 20, 2012

diosmosis commented Aug 26, 2012

diosmosis commented Aug 26, 2012

diosmosis commented Aug 26, 2012

mattab commented Sep 6, 2012

diosmosis commented Sep 9, 2012

diosmosis commented Sep 9, 2012

diosmosis commented Sep 9, 2012

diosmosis commented Sep 20, 2012

diosmosis commented Sep 20, 2012

diosmosis commented Sep 21, 2012

diosmosis commented Sep 21, 2012