Kernel density estimation for particles distributed over a 2-dimensional space is calculated using a single graphical processing unit (GTX 660Ti GPU) and CUDA-C language. Parallel calculations are done for particles having bivariate normal distribution and by assigning calculations for equally-spaced node points to each scalar processor in the GPU. The number of particles, blocks and threads are varied to identify favorable configuration. Comparisons are obtained by performing the same calculation using 1, 2 and 4 processors on a 3.0 GHz CPU using MPICH 2.0 routines. Speedups attained with the GPU are in the range of 88 to 349 times compared the multiprocessor CPU. Blocks of 128 threads are found to be the optimum configuration for this case.

1.
A.
Baklanov
,
A.
Mahura
,
J. H.
Sorensen
,
O.
Rigina
, and
R.
Bergman
, “Methodology for Risk Analysis based on Atmospheric Dispersion Modeling from Nuclear Risk Site,”
Danish Meteorological Institute–Scientific Report
,
Copenhagen, Denmark
,
2002
.
2.
A.
Stohl
,
C.
Forster
,
A.
Frank
,
P.
Seibert
, and
G.
Wotawa
,
Atmos. Chem. Phys.
,
5
,
2461
2474
(
2005
).
3.
F.
Molnar
Jr.
,
T.
Szakaly
,
R.
Meszaros
, and
L.
Lagzi
,
Computer Physics Communication
181
,
105
112
(
2010
).
4.
M.
Uliasz
, Lagrangian Particle Dispersion Modeling in Mesoscale Applications, Environmental Modeling II, edited by
P.
Zannetti
(
Computational Mechanics Publication
,
Southampton, Boston
,
1994
), pp.
71
102
.
5.
J.
Jimenez
and
J.
Ruiz de Miras
,
Computer Methods and Programs in Biomedicine
108
,
1229
1242
(
2012
).
6.
G.
Kalantzis
and
H.
Tachibana
,
Computer Methods and Programs in Biomedicine
113
,
116
125
(
2014
).
This content is only available via PDF.
You do not currently have access to this content.