CMB Data Analysis

Maximum Likelihood Analysis

Given an N_p-pixel map of the sky temperature \Delta_p and a measure of the pixel-pixel correlations in the noise N_pq we want to find the most likely underlying cosmological model and parameters that would produce the signal observed in the map. For a given model and set of parameters we calculate the associated pixel-pixel correlations in the CMB signal S_pq. Assuming that the signal and the noise are uncorrelated, the pixel-pixel correlations in the map - M_pq - are simply the sum of those in the signal and the noise. Assuming that the CMB fluctuations are Gaussian the likelihood of the particular model and parameters is

Our goal is to find the best-estimated set of cosmological parameters. That is those cosmological parameters which maximizes the likelihood function for that class of cosmological models. A short paper outlining the implementation of algorithms for locating the peak of the likelihood function for a general sky temperature map can be found here .

Timing

For a map with N_p pixels, and a target cosmological model with N_b parameters to be determined, the quadratic estimator algorithm requires

(2 x N_b + 2) x N_p² x 4 bytes of disc storage.
2 x N_p² x 8 bytes of RAM.
(2 x N_b + ²/₃ ) x N_p³ floating point operations.

assuming that

all the necessary matrices are simultaneously stored on disc in single (4-byte) precision.
matrices are loaded into memory no more than two at a time in double (8-byte) precision.

If the cosmological model has 10 parameters, then the computational requirements for the current MAXIMA and BOOMERanG balloon experiments for a single iteration of the algorithm on (i) a 600 MHz workstation and (ii) the NERSC Cray T3E-900 (using the specified number of processors and running at 2/3 peak) are

Dataset	Map Size	Disk	RAM	Flops	Serial CPU Time	T3E Time
BOOMERanG N.America	26,000	55 Gb	11 Gb	3.6 x 10¹⁴	7 days	2 hours (x 64)
MAXIMA-1	32,000	85 Gb	17 Gb	0.6 x 10¹⁵	12 days	5.6 hours (x 64)
MAXIMA-2	80,000^*	0.5 Tb	100 Gb	1 x 10¹⁶	7 months	9 hours (x 512)
BOOMERanG Antarctica	450,000^*	15 Tb	3 Tb	2 x 10¹⁸	100 years	70 days (x 512)

(* projected)

If we project further and consider MAP and Planck data sets, we find even larger numbers.
Assume that the cosmological model has 20 parameters (some cosmological & some astrophysical/experiments) parameters, then the computational requirements for the current MAP and PLANCK missions for a single iteration of the algorithm on (i) a 600 MHz workstation and (ii) the NERSC Cray T3E-900 (using the specified number of processors and running at 2/3 peak) are

Dataset Map Size Disk RAM Flops Serial CPU Time T3E Time

MAP
(single frequency) 10⁶ 160 Tb 16 Tb 4 x 10¹⁹ 2x10³ years 4 years
(x 512)

MAP 10^5-10⁶ 1.6-10³ Tb 0.1-16 Tb 4 x 10²² 2x 10¹-10⁴ years 0.4-40 years
(x 512)

PLANCK
(LFI) 10^6-10⁷ 10²-10³ Tb 16-1600 Tb 4 x 10²⁴ 2x10⁶years 200 years
(x 1024)

PLANCK
(HFI) 10⁷ 10⁴ Tb 1600 Tb 4 x 10²⁴ 2x10⁶ years 200 years
(x 1024)

(* projected)
Note that a single map containing N_pthe correlation matrix is N_pby N_pand requires storage size 4 N_p² in single precision
(not exploiting symmetry or same in double precision using the symmetry). Thus the correlation matrix for a single map with N_p= 10^6-10⁷, will be 4-400 Terabytes in size.

Return to Planck Data Processing & Analysis page