1 | /*! |
---|
2 | \page devguide_mat BDM Development - your first bdm class in Matlab |
---|
3 | |
---|
4 | bdmtoolbox provides a prodefined points of extensions in directory applications/bdmtoolbox/mex/mex_classes. |
---|
5 | |
---|
6 | The easiest way to extend bdmtoolbox is to use clasess starting with "mex". The simplies class of this type is mexFnc, which just contains name of an arbitrary matlab function to call. |
---|
7 | See \ref ug_pdf_fnc example. |
---|
8 | |
---|
9 | This tutorial assumes you are familiar with basic classes of BDM: pdfs, random variables (\ref userguide_pdf) and estimators (\ref userguide_estim). |
---|
10 | |
---|
11 | Previous part of the tutorial introduced ways how to use existing classes. If no premade class is suitable for your mneeds, you can create your own class in Matlab via predefined interfaces to class epdf (mexEpdf) and BM (mexBM). |
---|
12 | These classes are treated as regular C++ classes in prepared mex files. Creation of a new estimator (offspring of class BM) can be achieved by inheriting from class mexBM. |
---|
13 | |
---|
14 | In practice this means creation of a new file, e.g. my_bm.m, with header: |
---|
15 | \code |
---|
16 | class my_bm < mexBM |
---|
17 | \endcode |
---|
18 | This file must be located in directories known to matlab (must be in its path). This can be any directory if you use addpath command. |
---|
19 | See relevant documentation in Matlab. |
---|
20 | |
---|
21 | \section dev_mat_pdf Creating your own probability density |
---|
22 | |
---|
23 | Creation of your own density is achieved by inheriting from mexEpdf: |
---|
24 | \code |
---|
25 | class my_pdf < mexEpdf |
---|
26 | \endcode |
---|
27 | in a file called my_pdf.m. |
---|
28 | |
---|
29 | Calling methods from mexEpdf will result in an error that tells you which functionality is missing. |
---|
30 | It is necessary to give them a meaning. |
---|
31 | |
---|
32 | Epdf is a class representing probability density function of a multivariate (vector) variable. |
---|
33 | |
---|
34 | The easiest example of a density is dirac Delta function |
---|
35 | \f[ |
---|
36 | f(x_t) = \delta( x_t - \bar{x}_t ) |
---|
37 | \f] |
---|
38 | with properties: |
---|
39 | - \f$ \bar{x}_t \f$ is the statistics of this density. |
---|
40 | - dimension of the density is dimension of the vector \f$ \bar{x}_t \f$ |
---|
41 | - the mean value of the density is \f$ E(x_t) = \bar{x}_t \f$ |
---|
42 | - the variance value of the density is \f$ var(x_t) = zeros() \f$ |
---|
43 | - all samples from this density are equal to \f$ \bar{x}_t \f$ |
---|
44 | |
---|
45 | These properties are encoded in software as follows: |
---|
46 | |
---|
47 | \code |
---|
48 | classdef mexDirac < mexEpdf |
---|
49 | % Dirac delta probability distribution |
---|
50 | properties % DATA structures |
---|
51 | point % point of the support, \bar{x}_t |
---|
52 | end |
---|
53 | methods |
---|
54 | function m=mean(obj) % compute mean values of the density |
---|
55 | m = obj.point; |
---|
56 | end |
---|
57 | function obj=validate(obj) % check if data structures are consistent |
---|
58 | % point should be a column |
---|
59 | if (size(obj.point,2)>1) |
---|
60 | if (size(obj.point,1)==1) % it is row |
---|
61 | obj.point = obj.point'; |
---|
62 | end |
---|
63 | else |
---|
64 | error('Point in mexDirac is not a vector'); |
---|
65 | end |
---|
66 | end |
---|
67 | function dim=dimension(obj) % inform others about your dimensions |
---|
68 | dim = size(obj.point,1); |
---|
69 | end |
---|
70 | function v=variance(obj) % compute variance |
---|
71 | v=zeros(size(obj.point)); |
---|
72 | end |
---|
73 | function l=evallog(obj,x) % return logarithm of your density at point x |
---|
74 | if obj.point==x |
---|
75 | l = inf; |
---|
76 | else |
---|
77 | l=0; |
---|
78 | end |
---|
79 | end |
---|
80 | function s=sample(obj); % return random sample from your density |
---|
81 | s = obj.point; |
---|
82 | end |
---|
83 | end |
---|
84 | end |
---|
85 | \endcode |
---|
86 | |
---|
87 | An additional compulsory function is called \c validate which is called always before use of the class in an algorithm. The task of this function is to detect any inconsistency in the object and give the user an appropriate warning. |
---|
88 | For example, in this case, the function checks if the property point is indeed a vector and not a matrix or other structure. |
---|
89 | |
---|
90 | Once you define the methdos above, this object can be plugged into (almost) an arbitrary algoritm, such as simulator, etc. |
---|
91 | |
---|
92 | For seamless use with other objects, you need to define its random variables - via attribute rv, \ref ug_pdf_create. |
---|
93 | |
---|
94 | For inspiration see example class mexLaplace.m and its use in simulation in tutorial/userguide/epdfds_example.m |
---|
95 | |
---|
96 | \section dev_mat_bm Defining your own Bayesian Estimator (BM) |
---|
97 | If you look into the mexBM.m file you see the basic attributes and default functions. mexBM as such will not work as a valid estimator. |
---|
98 | In order to make it work, you need to re-define at least functions <b>dimensions</b> and <b>bayes</b>, and attribute <b>apost_pdf</b>: |
---|
99 | |
---|
100 | This function announces sizes of random variables to other objects. |
---|
101 | |
---|
102 | These sizes can be fixed for a specific problem, or derived from size of internal objects. E.g. for a Kalman filter, the function would be: |
---|
103 | \code |
---|
104 | function dims=dimensions(p) |
---|
105 | %please fill |
---|
106 | %dims = [size_of_posterior size_of_data size_of_condition] |
---|
107 | dims = [size(A,1), size(C,1), size(B,2)] % |
---|
108 | end |
---|
109 | \endcode |
---|
110 | Which indicates that dimension of \f$x_t\f$ is the rows of matrix A, size of \f$y_t\f$ is the rows of matrix C and the size of \f$u_t\f$ is given by columns of matrix B. |
---|
111 | |
---|
112 | The argument apost_pdf should be a valid object of class mexEpdf, see previous section how to achieve it. |
---|
113 | |
---|
114 | The second function to extend is |
---|
115 | \code |
---|
116 | function obj=bayes(obj,dt,cond) |
---|
117 | % transform old estimate into new estimate |
---|
118 | end |
---|
119 | \endcode |
---|
120 | |
---|
121 | Here, you need to define an update (both time- and data-update) of your Bayesian filter. |
---|
122 | |
---|
123 | Note, that statistics of the posterior are stored in attribute apost_pdf. So, the task of this function is to update fileds in apost_pdf. |
---|
124 | |
---|
125 | Any additional attributes needed for this task shoul be added to properties of the object. |
---|
126 | |
---|
127 | |
---|
128 | \section dev_exa Example task: Laplace estimation |
---|
129 | |
---|
130 | Consider a systems, where data are generated by a Laplace model |
---|
131 | \f[ |
---|
132 | f(y_t|a,b) = \frac{1}{2b} \exp \left( -\frac{x-a}{b} \right) |
---|
133 | \f] |
---|
134 | with unknown time-varying parameters \f$ a \f$ and \f$ b \f$. |
---|
135 | |
---|
136 | We decided to estimate parameters of the system using ML estimates of parameters on a sliding window. The estimates are then: |
---|
137 | \f{eqnarray} |
---|
138 | \hat{a}& =& \mathrm{median}([y_{t},y_{t-1},\ldots , y_{t-d}),\\ |
---|
139 | \hat{b}& = & \frac{1}{d} \sum_{\tau=t-d}^{t} | y_\tau - \hat{a} | |
---|
140 | \f} |
---|
141 | |
---|
142 | Implementing this system in BDM requires the following steps: |
---|
143 | - realize that ML estimation is an extreme approximation of Bayes rule with posterior in the form of Dirac delat pdf, this pdf is ready from the previous step. |
---|
144 | - under this choice, the posterior is a 2-dimensional dirac, which is encoded as: |
---|
145 | \code |
---|
146 | apost_pdf = mexDirac; |
---|
147 | apost_pdf.point = [0;0]; |
---|
148 | \endcode |
---|
149 | - the class mexDirac as introduced has only one attribute: point, which is a vector of the support. If we wish to distinguish the meaning of these values, we use attribute rv. For example, |
---|
150 | \code |
---|
151 | apost_pdf.rv = RV({'a','b'}); |
---|
152 | \endcode |
---|
153 | - the class representing an estimator of this type, needs to remember internal statistics in the form window of data, which will be stored in properties of the class |
---|
154 | |
---|
155 | - dimensionality of the estimator is then: 2-dimensional parameters, 1-dimensional observation, 0-dimensional conditioning variables. It is encoded as: |
---|
156 | \code |
---|
157 | function dims=dimensions(p) |
---|
158 | %please fill |
---|
159 | %dims = [size_of_posterior size_of_data size_of_condition] |
---|
160 | dims = [2,1,0] % |
---|
161 | end |
---|
162 | \endcode |
---|
163 | - the method \c bayes then copies the obtained data into the window and updates estimates \f$ \hat{a}, \hat{b}\f$ |
---|
164 | |
---|
165 | This estimator has been implemented as class mexLaplaceBM as follows: |
---|
166 | |
---|
167 | This is implemented as follows: |
---|
168 | \code |
---|
169 | classdef mexLaplaceBM < mexBM |
---|
170 | % Approximate Bayesian estimator of parameters of Laplace distributed observation. |
---|
171 | % Maximum likelihood approximation of the Bayes rule is used, posterior is in the form of dirac. |
---|
172 | properties |
---|
173 | max_window_length = 10; % max window length (default = 10) |
---|
174 | data_window =[]; % sliding window of data |
---|
175 | end |
---|
176 | methods |
---|
177 | function obj=validate(obj) % prepare all internal objects for use |
---|
178 | obj.apost_pdf = mexDirac; |
---|
179 | obj.apost_pdf.point = [0;0]; |
---|
180 | obj.log_evidence = 0; % evidence is not computed! |
---|
181 | end |
---|
182 | |
---|
183 | function dims=dimensions(obj) |
---|
184 | %please fill: dims = [size_of_posterior size_of_data size_of_condition] |
---|
185 | dims = [2,1,0] % we have: [2d parameters, 1d observations, 0d condition] |
---|
186 | end |
---|
187 | function obj=bayes(obj,dt,cond) % approximate bayes rule |
---|
188 | if size(obj.data_window,2)>=obj.max_window_length |
---|
189 | obj.data_window = [dt obj.data_window(1:end-1)]; |
---|
190 | else |
---|
191 | obj.data_window = [dt obj.data_window]; |
---|
192 | end |
---|
193 | % transform old estimate into new estimate |
---|
194 | m_hat = mean(obj.data_window); |
---|
195 | b_hat = sum(abs(obj.data_window-m_hat))/ size(obj.data_window,2); |
---|
196 | obj.apost_pdf.point = [m_hat; b_hat]; % store result in psoterior pdf |
---|
197 | end |
---|
198 | end |
---|
199 | |
---|
200 | end |
---|
201 | \endcode |
---|
202 | |
---|
203 | \section dev_mat_alt Alternative representations of this estimator |
---|
204 | |
---|
205 | Note, that the estimator above is rather coarse approximation of the Bayes rule. Indeed, in this case, the sliding window represnts a statistics on the posterior parameters, |
---|
206 | and thus it should be part of the posterior, which can thus evaluate more moments that the current Dirac density. |
---|
207 | |
---|
208 | Such a design is certainly possible, however, it would result in much more complex calculations and more complex coding. If that is the aim, a new class can be designed. |
---|
209 | |
---|
210 | \section dev_mat_pred Predictors and simulation |
---|
211 | |
---|
212 | Note, that the Laplacian likelihood model of the example is not needed for its estimation. The very basic estimator introduced above does not need to know about existence of the likelihood function. |
---|
213 | |
---|
214 | However, existence of this distribution is required by related tasks of data simulation and data prediction. |
---|
215 | Therefore, a class mexLaplace was designed, which implements all operations of pdf in the same manner as the in \ref dev_mat_pdf. |
---|
216 | |
---|
217 | This pdf is then used in the mexLaplaceBM in function that creates predictors: |
---|
218 | \code |
---|
219 | function p=epredictor(obj,cond) % when predictive density is needed approximate it by Laplace with point estimates |
---|
220 | % return predictive density (max likelihood) |
---|
221 | p = mexLaplace; |
---|
222 | p.mu = obj.apost_pdf.point(1); |
---|
223 | p.b = obj.apost_pdf.point(2); |
---|
224 | % do not forget to validate |
---|
225 | p=p.validate; |
---|
226 | % assign descriptions |
---|
227 | p.rv = yrv; |
---|
228 | % rvc is empty be default |
---|
229 | end |
---|
230 | \endcode |
---|
231 | |
---|
232 | Also, the class mexLaplace can be used to generate data, see bdmtoolbox/mex/epdfds_example.m |
---|
233 | |
---|
234 | Since it provides all required methods, it can be combined with other pdfs in chain rules, as in pdfds_example.m |
---|
235 | |
---|
236 | Similar extensions to mexEpdf and mexBM can be made to other classes. See library/mex/mex_Epdf.h and library/mex/mex_BM.h how it is done. |
---|
237 | As always, feel free to contact the maintainer for help. |
---|
238 | */ |
---|