Context Navigation

unit_testing.dox @ 544

Revision 543, 8.1 kB (checked in by vbarta, 16 years ago)
added unit test documentation

Line
1	/*!
2	\page unit_testing Unit testing of BDM
3
4	Unit tests are simple, fast and easy to run tests of small pieces of
5	code (the "units") designed to reassure developers that individual
6	parts of the developped functionality behave as they're meant to. BDM
7	being an object-oriented library, the independent units are classes,
8	tested by calling their individual methods and comparing actual to
9	expected results.
10
11	Unit tests are especially useful for C++, where the simplest change
12	can introduce (or uncover) a difficult-to-find bug. They also
13	facilitate changing the internal structure of classes without
14	modifying their external behavior - unit tests can be run before and
15	after such a change to make sure the behavior had indeed stayed the
16	same. More ambitiously, tests can be written before implementing new
17	functionality, serving as its machine-checkable specification. Some
18	authorities go so far as to demand that "not a single line of code" be
19	written before having a failing unit test for it first, but BDM unit
20	testing isn't that demanding - it does facilitate the style for
21	developers who prefer it, but most BDM code is tested retroactively,
22	with new tests suggested by various considerations (see below).
23
24	\section Setup
25
26	Unit tests of BDM use a 3rd-party unit testing framework, UnitTest++
27	(from http://unittest-cpp.sourceforge.net/ ), which is designed
28	specifically for C++, reasonably popular and maintained (for a C++
29	unit testing framework), works on Unix, Windows and Mac, allows simple
30	test addition and proved to be readily extensible. Sources of
31	UnitTest++ are included in directory library/tests/unittest-cpp, so
32	that they can be built as part of the normal BDM build, without the
33	need for explicit installation and setup, and are lightly modified for
34	specific requirements of BDM.
35
36	Existing unit tests are collected into a testsuite executable (so that
37	they can all easily be run together), whose sources are in
38	library/tests. Apart from the top-level testsuite.cpp, testsuite
39	sources are generally named <classname>_test.cpp, where
40	<classname> is the class tested by the file (or a base class of
41	classes tested by the file, when these derived classes are similar
42	enough to be tested in the same way). BDM doesn't test abstract
43	classes using mock-ups (although it does test some template
44	instantiations on test-only classes) - all its abstract classes
45	already have concrete children and methods of the base class can be
46	tested on them.
47
48	When run, testsuite prints to standard output the names of performed
49	tests and error messages when they fail, which means, among other
50	things, that it should generally be run from the command line - even
51	on Windows. Note that while UnitTest++ does have support for more
52	elaborate GUIs, this support hasn't been extended for BDM-specific use
53	cases. Testsuite can also be run under debugging tools - not only
54	under a debugger, but also valgrind for checking memory management,
55	coverage tools etc.
56
57	Specific tests can be run individually, by passing their name (or
58	names) as a command-line argument to testsuite. Note that a test which
59	behaves differently depending on which tests run before it is a bug in
60	testsuite (this can happen e.g. when using RVs, which are global -
61	it's probably best to include the tested class name in their names,
62	although not all existing tests do that yet). Tests of sampling and
63	other functionality depending on random inputs can occasionally fail.
64	Whether that's due to skewed inputs, BDM bugs or too tight error
65	limits is not clear. Testsuite aims for every test passing with at
66	least 95% probability, but the quantification of error limits isn't
67	finished yet.
68
69	Not all tests depending on UnitTest++ are in testsuite. Performance
70	experiments have their own executables - currently just
71	square_mat_stress, for testing BDM matrix wrappers. square_mat_stress
72	is more configurable than testsuite: it doesn't generate its own
73	random data, but reads them from a configuration file (named
74	agenda.cfg by default) generated by another executable,
75	square_mat_prep. This allows generation (and checking) of test inputs
76	for various numerically demanding scenarios, which are encapsulated in
77	square_mat_prep's generators - that is, classes implementing its
78	generator interface. generators themselves are configurable using the
79	same scripting framework as other BDM classes, and both
80	square_mat_prep and square_mat_stress have command-line parameters -
81	run them with '-?' to see the list. Analogical support can be created
82	for other classes which would benefit from it. Focus of stress tests
83	is rather different from unit tests as generally understood - that a
84	class has stress tests doesn't mean it shouldn't be tested separately
85	inside testsuite.
86
87	\section new_unit_tests Adding new unit tests
88
89	Adding new unit tests is encouraged - it's a good way to get
90	acquainted with the library, and there's always space for more
91	tests.
92
93	Unit tests are implemented by adding a block of code (it actually
94	isn't just a function body, but it looks like one) starting with the
95	TEST macro, whose argument is the test name:
96
97	\code
98	TEST ( test_egiw ) {
99	epdf_harness::test_config ( "egiw.cfg" );
100	}
101	\endcode
102
103	"UnitTest++.h" should be included in every test file to have the macro
104	defined.
105
106	Tests are usually named test_<classname>, or
107	test_<classname>_postfix when a class has more than one test:
108
109	\code
110	TEST ( test_egiw_1_2 ) {
111	\endcode
112
113	This is useful when a class should be thoroughly tested from multiple
114	aspects, e.g. for different dimensions. Note that each test in
115	testsuite must have a unique name, which must be a legal C++
116	identifier.
117
118	Testsuite has explicit support for testing classes derived from
119	bdm::epdf and bdm::mpdf: bdm::epdf_harness and
120	bdm::mpdf_harness. These classes run a list of tests on objects
121	created from the specified configuration file (normally called
122	<classname>.cfg) and check them against expected values also
123	specified in that file, so that a test using harness is just a
124	single-line call to its test_config method (as above).
125
126	Unit tests can also be written explicitly. The test body is normal C++
127	code, implementing the checks of actual against expected values using
128	various CHECK_* macros defined by UnitTest++:
129
130	\code
131	double summ = 0.0;
132	for ( int k = 0; k < n; k++ ) { // ALL b
133	...
134	}
135
136	CHECK_CLOSE ( 1.0, summ, 0.1 );
137	\endcode
138
139	Note that when checking vectors or matrices, you should also include
140	"mat_checks.h", which defines their comparison. Unit tests shouldn't
141	require any user interaction and probably shouldn't output anything
142	other than error reports via the UnitTest++ facilities - among other
143	things, UnitTest++ uses C functions for its output, and therefore a
144	test printing via std::cout and std::cerr may report in rather
145	confusing mixed order.
146
147	Developers wishing to add new tests can use various heuristics to come
148	up with the specific functionality to test - a selection is described
149	below, roughly in the order of usefulness.
150
151	\subsection existing_bugs Eliciting existing bugs
152
153	When the library has a bug, it may be useful to isolate the smallest
154	possible code snippet which exhibits it - if not always, then at
155	least as reliably as possible. Such a snippet is a good candidate for
156	a unit test - it can be added either before the bug is fixed, or
157	afterwards.
158
159	\subsection tricky_parts Testing the code known to be tricky
160
161	Every body of code has its dark corners - copy&pasted legacy code,
162	highly optimized algorithms, complicated interactions. These should be
163	tested first.
164
165	\subsection unclear_behavior Documenting and stabilizing unclear behavior
166
167	The library can behave in ways surprising its users without being
168	necessarily "wrong". In such cases, it's most important that all
169	maintainers agree on what the correct behavior should be (so that
170	attempts to fix it don't run in circles). Unit tests constraining the
171	contested functionality communicate very efficiently what their author
172	thinks the behavior should (or shouldn't) be.
173
174	\subsection coverage Increasing test coverage
175
176	Ideally, unit tests should test every method of every class (there are
177	higher ideals, but they're even less realistic). Failing that, at
178	least some methods of every class should be tested. Coverage tools
179	(e.g. gcov) can show which parts of the library are not exercised by
180	testsuite.
181
182	*/

Note: See TracBrowser for help on using the browser.

Download in other formats:

Original Format