This is a Wiki; login to edit. If you don’t have a login or you don’t have enough privileges, e-mail Curt.
In an effort to cross validate the various black hole parameter estimation codes that have been developed the following exercise has been suggested:
The first step is to adopt the MLDC conventions for the LISA orbit. A brief version of the MLDC conventions document can be found here. The noise model to use is available in the form of a c-code Noise.c, which uses the header file Cnst.h. The LISA orientation parameters kappa and lambda should both be set to their MLDC default values of zero and the low frequency cut-off should be set at 1e-5 Hz. It is also very important that all the codes use the same values for the physical constants found in Cnst.h.
Four test examples are provided in the files mbh1a.txt, mbh1b.txt, mbh2a.txt, mbh2b.txt. The “a” set is for one year of observations and a time to merger of one year. The “b” set is for the same systems, but with time to merger of 1.05 years. The “b” sets should be helpful since the results will not depend on how the waveforms are truncated at the ISCO.
For the purposes of the comparison exercise both the amplitude and the phase should be computed to 2PN order.
The masses are given in solar mass units, and correspond to the physical masses. The redshifted masses are (1+z) times larger.
The sky location is given in terms of the ecliptic co-latitude θs and longitude Φs. The ecliptic latitude, lats = π/2 - θs is also quoted to remind people to be careful about this convention. The orbital orientation is given in terms of co-latitude θl and longitude Φl. The orientation is also given in terms of inclination ι and polarization angle ψ using the standard MLDC conventions. The quantity φc is the ORBITAL phase at coallesence. All angles are quoted in radians.
tar -zxvf file.tar), you will find six files corresponding to the waveforms in the time and frequency domain for the detectors I and II. In the time domain, the strain is real (ampl*cos(phase)), so the 1st column corresponds to the time values and the 2nd one to the strain. In the frequency domain (computed analytically using the Stationary Phase Approximation) the 1st column corresponds to the frequency values, the 2nd for the real part and the 3rd one for the imaginary part. We have also added two files (*_allharm.dat) separating the contribution of different harmonics, so 1st col. correspond to frequency values, 2nd & 3rd to real and imaginary part of j=1 harmonic, 4th & 5th (j=2), ..., 12nd & 13rd (j=6) and 14th & 15th (total). In our original parameter estimation code we only worked with the waveform in the frequency domain (computed analyticaly using SPA); we modified a little bit that code in order to also obtain the waveform in the time domain, which is computed directly from Blanchet expressions, NOT as an invFT of the other ones. See comments and plots comparing our results with Cornish & Hughes’ ones in their section (now everything matches pretty well between CH and ST, we were wrong computing the waveform in the time domain and now we also have included all the harmonics in the frequency domain waveform).The MLDC version of the Barack-Cutler kludge waveforms provide a good starting point for the comparison of EMRI parameter estimation codes. The most useful data sets for this purpose are the Challenge 1.3.x and 1B.3.x training data, as the training data sets come with noise-free versions of the waveforms. These noise free waveforms are very useful for checking codes. The MLDC EMRI waveforms are fully documented here.
The EMRI search code developed by MtGWA group has been used to perform Fisher and MCMC studies of the Challenge 1.3.x training data. The following link is copied from the MLDC taskforce wiki:
* 2007/5/3: First look at EMRI posterior distributions for 1.3 training data EMRI notes
In those notes the explicit values of the Fisher matrix uncertainties were not listed. Using the key file parameter values as input, here are the Fisher uncertainties for Training 1.3.1. ( See revised results from 2008/4/8 below )
For reference, the fitting factor for the MtGWA EMRI code (which uses the low frequency approximation to the LISA response and interpolated waveforms to cut template generation times to less than a second) is 0.9993 for the 1.3.1 training data.
* 2008/4/8: It looks like the numerical central differencing offset of 1e-6 used in the Fisher computation may have been too large for some of the parameters. This quantity has been reduced to 1e-9 (consistent results have been found for 1e-10 and 1e-11), and the Fisher matrices have been recomputed. Here are the results for MLDC training 1.3.x: Training 1.3.1, Training 1.3.2, Training 1.3.3, Training 1.3.4, Training 1.3.5. It is worth noting that these values do not agree with the MCMC results (at least for 1.3.1), so more investigation is needed. The MCMC runs are currently being repeated.
* 2008/4/10: The problems with the Fisher estimates continue. I have increased the number of samples in the waveform generation and gone over to 4th order accurate numerical differencing (up from 2nd order accurate). The problems have not gone away. Over a range of numerical differencing parameters epsilon, the entries in the Fisher matrix are fairly stable - at most ten percent differences in the values. But this is enough to change the error estimates by orders of magnitude. Note that only the entries involving the mass, spin, eccentricity and lambda angle are significantly affected. The extrinsic parameters are stable. Here are some results from the training 1.3.1 case using the 4th order accurate numerical differencing epsilons of 1e-8 and 1e-9. Note that the Fisher matrices Fisher matrix 1e-8 and Fisher matrix 1e-9 differ by at most 6% in any entry, while the parameter errors Sigmas e-8 and Sigmas e-9 differ by orders of magnitude in some of the parameters. The reason for this can be traced to the large condition number of the Fisher matrices, which makes the inversions very sensitive to small changes.
* 2008/8/28: PROBLEM SOLVED. Motivated by today’s telecon, I returned to the EMRI Fisher matrix problem and found a simple solution: The eigenvalues of the Fisher matrix come in two sets, one set of 5-6 has very large eigenvalues (generally related to the intrinsic parameters), while the rest have very small eigenvalues. Some time ago I tried just deleting the small eigenvalue terms (in other words, using an SVD approach), and while that helped with the intrinsic parameters, it messed up everything else. The eventual fix is very simple: when computing the errors in the intrinsic parameters, delete the small eigenvalue contributions. When computing the errors in the extrinsic parameters, keep all the eigenvalues. This completely fixed the instability problem, and gives results in excellent agreement with the new MCMC runs. See the attached graph for a comparison of the Fisher matrix prediction in blue and the MCMC derived posterior in red for MLDC source 1B.3.1.
17/1/2008: We also have a Monte Carlo search code which we have been using for the MLDC. This employs an approximate waveform model, which is based on Barack and Cutler but with some simplifications to speed up waveform computation. As a first investigation, we are computing posteriors for the Round 1.3 training data sets. We have preliminary results that use a normalized waveform search code, i.e., with maximization over distance rather than including distance as a search parameter. Results from this code for training set 1.3.2 are here.
We are now debugging a search code that uses un-normalized waveforms, and should have results from this for all of the Round 1.3 training sets within two weeks.
13/2/2008: We now have results that can be directly compared to those of Neil Cornish, i.e., these are distributions over the full parameter space, and use parameters at t=0 rather than parameters at plunge. At present we still have results for 1.3.2 only, but are running the code for the remaining four data sets now. Results for training set 1.3.2 are here.
28/2/2008: We now have results for the other four training data sets, computed in the same way as the 1.3.2 results described above. A document summarizing results for all five training data sets is available here.
09/04/2008: We have carried out parameter estimation using the start and plunge parameter Fisher matrices for all of the 1.3 training data sets. We find that the calculation of our Fisher matrix is extremely sensitive to the parameter offset. A document detailing our parameter error and convergence tests is available :here. We need to run some MCMCs to further investigate the Fisher matrix for each challenge data set.
22/05/2008: We have tried to carry out dumb MCMCs using uniform priors from the most optimistic/pessimistic error estimates from our previous analysis. We confined ourselves to the Challenge 1.3.2 and 1.3.4 training data sets. We found that for 1.3.2 our pessimistic estimate was too large by orders of magnitude, and our most optimistic estimate was too small by a factor of 3-5. For 1.3.4 we found that the most optimistic error estimate was too small by over an order of magnitude. We have now calculated the FIM as the second derivative of the expectation value of the log likelihood. While this FIM is more convergent, it still looks to have issues of ill-conditioning. Furthermore it is expensive to compute so its use is limited. A document describing this can be found :here.
2/10/2008: We have done some investigations to test Neil’s suggestion that using the full Fisher Matrix to estimate extrinsic and phase parameter errors, while using SVD to compute intrinsic parameter errors might yield robust and accurate results. A document describing these results can be found here. We find results that are consistent with Neil’s suggestion for sources 1.3.1-1.3.3, but it is not so clear for 1.3.4-1.3.5. We are now looking at only the intrinsic-intrinsic and extrinsic-extrinsic sub Fisher Matrices to see if these yield useful estimates. As yet, we have not compared our results to the numerically computed posteriors obtained earlier.
These are preliminary notes on black hole formation models by Marta and Emanuele, including some references and topics for discussion.
Preliminary results on spin evolution due to mergers and accretion are presented here. This is a short paper summarizing the main results.
Here are corrected histograms obtained by running Scott/Neil’s MBHB PE code (including both higher harmonics and spin precession) on Marta’s model universes. (A previous version had failed to convert from dN/dV to dN/dt.) The document now shows 2 cases, corresponding to high and low spins. 3/26/08: Actually it turns out even the above results are a little off. Due to a typo in an email and one data file that was not completely read, the parameter distributions for the above results are a little bit off. By eye they looked fine. In any case, those problems were fixed before the following results re two descope options were generated.
Here are MBH param estimation results that were generated in response to a request by John Morse (head of Astrophysics at NASA) that the LISA Project investigate the cost/science loss of descope options. The U.S. LIST fixed on the following 3 noise curves–baseline, descope1, and descope2. For the MBH part of the report, I used the following 3 source distribution models generated by Marta Volonteri. Results from running Scott/Neil’s code on these 3 distributions for the 3 LISA configs are summarized in this MBHB results document. I should note that this MBH work was part of a much larger effort, especially by Neil Cornish and Jon Gair, to look at the effect of descopes on Galactic binaries, EMRIs, and MBHBs.
Update: May 19th, 2008. Marta asked about the parameter distribution of detectable sources. This document gives histograms of the physical parameter distribution for sources with total (A+E) SNR >10, for the baseline noise and the iso-eff source distribution. [Note that by mistake an earlier version of this document plotted the redshifted masses. The histograms now plot the locally-measured masses.] Also, Miquel asked about how angular resolution and distance resolution depend on source redshift. That’s shown in this accuracy versus distance,(updated June 13) which is again just for the iso-eff distribution, and 2 noise curves: baseline and descope1. I can make more plots like this, but there are hundreds of plots one COULD make, so want to know which ones people want the most.
Results document for the Proceedings, Oct.’08 Curt is putting results from running Scott/Neil’s code on Marta’s 4 models here(results). For the Proceedings, we agreed to only consider 2 noise curves: “baseline” and ‘6link’, where these designations have the same meaning as in the current working version of the LISA ScRD. The input distribtions–i.e., distributions of masses, spins, redshifts,etc.–for Marta’s four models are here(input distributions).
Here is the final draft of the proceedings. This is the source file in tgz.
Please email you acknowledgments (if missing), comments and corrections to (berti at wugrav dot wustl dot edu)
If you need the CQG style files, they are in this tgz file.