FINESTION-2003-2006 a complete dataset of routine air ion measurements 2003 – 2006 Hannes.Tammet@ut.ee in co-operation with: Urmas Hõrrak, Kaupo Komsaare, Pasi P. Aalto, Petri Keronen, and Sander Mirme Pühajärve 2007
T, Tgrad, p, RH, UV, wind, prec, W/m2, visibility What is included? Tahkuse Hyytiälä Tartu Instrument Tahkuse AIS BSMA1 BSMA2 From 23.07.2003 20.03.2003 02.04.2004 To 31.12.2006 Meteo T, Tground, p, RH, wind, W/m2 T, Tgrad, p, RH, UV, wind, prec, W/m2, visibility T, p, RH, wind, prec, lux, W/m2 Chemistry NOx,SO2,O3 Dose rate Mobility 0.00065–2.74 0.036–2.74 Size 0.5–65 nm 0.5–6.5 nm Neutral aerosols are not presented in the distribution version of the dataset
Amount of data 283 164 diurnal records, mostly 24×6 = 144 measurements per day (some variables 48 or 24 measurements per day) . Total about 40×106 values (inlc. less than 10% of missing codes). Unzipped file 146 958 411 bytes, about 3.8 bytes per value, Zipped file 52 megabytes, about 1.4 bytes per value. Format: DD (DataDiurna) diary.
Preprocessing of data BSMA (single-channel scanning) data: The records with BSMA noise index ≥ 50 were deleted as unreliable (about 15% of measurements was lost in Hyytiälä and 16% in Tartu). The measurements made with a period of 15 min were converted to the 10-minute period using interpolation by means of the DD data manager. The values of RH% recorded by BSMA were corrected so that the nighttime maximums do not exceed 100%. The fraction concentrations were converted to the values of distribution functions dn / d(log d) and dn / d(log Z) assigned to the centers of the fractions in the logarithmic scale of diameter or mobility. The indices of technical diagnostics were replaced with the integral parameters: cluster ion concentration (Z > 0.56 cm2V−1s−1), intermediate ion concentration (Z = 0.032 − 0.56 cm2V−1s−1), cluster ion average mobility and air polar conductivity.
Tahkuse AIS (multichannel instrument) data: The physically unbelievable measurements were marked as missing. The records with too many missing measurements were deleted . The measurements recorded every 5 minutes were converted to the time step of 10 minutes. If a missing value was located between known values then the gap was filled using a sophisticated interpolation method. The data was complemented with the integral parameters: cluster ion (Z > 0.56 cm2V−1s−1), intermediate ion (Z = 0.032 − 0.56 cm2V−1s−1), and large ion (Z = 0.0006 − 0.032 cm2V−1s−1) concentrations; cluster ion average mobility and air polar conductivity. The data was transformed to the dn / d(log d) and dn / d(log Z) values so that a decade is logarithmically uniformly divided into 8 fractions. Thus the first 16 fractions are just the same as in the BSMA measurements. The mobility was converted to the particle size considering the measurements of air pressure and temperature.
Included quantities Symbol Quantity T:C Air temperature near the station Tg:C Ground temperature near the station Tgr:K/m Gradient of temperature Tc:C Chimney temperature near the station RH% Relative humidity P:mb Air pressure near the station Wind:m/s Wind velocity at standard height Wmax:m/s Wind 5-minute maximum at standard height Wind:deg Wind direction Prec:mm/h Precipitation intensity Illum:lux Solar illuminance Rad:W/m2 Total insolation sun+sky UVB:W/m2 UV radiation 310-320 nm Vis:km Visibility DR:uSv/h Gamma radiation dose rate NOx:ug/m3 Concentration of NOx (1 ppb is converted to 191 ug/cm3) SO2:ug/m3 Concentration of SO2 (1 ppb is converted to 2.66 ug/cm3) O3:ug/m3 Concentration of O3 (1 ppb is converted to 2.0 ug/cm3)
Symbol Unit Quantity B-noise cm-3 BSMA noise index RC+ cm-3 Tahkuse index of spectra roughness for positive clusters RN+ cm-3 Tahkuse index of spectra roughness for positive nanoparticles RP+ cm-3 Tahkuse index of spectra roughness for positive particles RC- cm-3 Tahkuse index of spectra roughness for negative clusters RN- cm-3 Tahkuse index of spectra roughness for negative nanoparticles RP- cm-3 Tahkuse index of spectra roughness for negative particles NC+:cm-3 1/cm3 Concentration of positive cluster ions (Z > 0.5 cm2/Vs) NC-:cm-3 1/cm3 Concentration of negative cluster ions (Z > 0.5 cm2/Vs) NN+:cm-3 1/cm3 Concentration of positive nanometer ions (Z = 0.03-0.5 cm2/Vs) NN-:cm-3 1/cm3 Concentration of negative nanometer ions (Z = 0.03-0.5 cm2/Vs) NP+:cm-3 1/cm3 Concentration of positive particle ions (Z = 0.0003-0.03 cm2/Vs) NP-:cm-3 1/cm3 Concentration of negative particle ions (Z = 0.0003-0.03 cm2/Vs) ZC+cm cm2/Vs Average mobility of positive cluster ions (Z > 0.5 cm2/Vs) ZC-cm cm2/Vs Average mobility of negative cluster ions (Z > 0.5 cm2/Vs) L+:fS/m fS/m Positive conductivity of air L-:fS/m fS/m Negative conductivity of air
z+01 cm-3 dN/d(log Z) of positive ions at Z = 2.74 cm2/Vs
d+01 cm-3 dN/d(log D) of positive ions at D = 0.49 nm
STATISTICS
Station Quantity Days Min Ave Max Sigma Hyyt NC+:cm-3 1298 27 737.9 1788 225.3 Hyyt NC-:cm-3 1298 -481 721.3 2383 239.8 Hyyt NN+:cm-3 1285 -869 16.4 1032 47.1 Hyyt NN-:cm-3 1285 -804 21.5 6046 93.9 Hyyt ZC+cm 1298 0.85 1.445 2.04 0.139 Hyyt ZC-cm 1298 0.94 1.638 2.24 0.155 Tartu NC+:cm-3 750 -143 397.1 1618 175.6 Tartu NC-:cm-3 750 -164 303.4 4317 160.3 Tartu NN+:cm-3 745 -997 30.1 1933 88.9 Tartu NN-:cm-3 745 -880 36.9 7421 121 Tartu ZC+cm 750 0.02 1.434 2.59 0.086 Tartu ZC-cm 750 0.12 1.581 3.6 0.122 Tahk NC+:cm-3 1103 0 436.4 1200 161.2 Tahk NC-:cm-3 1100 0 370.3 1200 153.7 Tahk NN+:cm-3 1103 -30 114.4 1199 117.1 Tahk NN-:cm-3 1102 -30 98.5 1199 136.1 Tahk NP+:cm-3 1102 0 1371.5 8982 819 Tahk NP-:cm-3 1103 0 1389.8 8984 830.6 Tahk ZC+cm 1100 0.6 1.419 2.1 0.081 Tahk ZC-cm 1094 0.6 1.563 2.08 0.113
Station Quantity Days Min Ave Max Sigma Hyyt d-01 1298 -1565 1293.9 4952 568.7 Hyyt d-02 1298 -7720 1974.9 6060 603.4 Hyyt d-03 1298 -741 1730.7 10912 813.3 Hyyt d-04 1298 -1085 707.7 4857 628.8 Hyyt d-05 1298 -1186 93.4 6824 182.6 Hyyt d-06 1298 -816 37.0 11016 193.0 Hyyt d-07 1298 -2376 37.4 11344 207.3 Hyyt d-08 1298 -3160 28.0 7784 157.8 Hyyt d-09 1298 -4504 19.8 7560 157.7 Hyyt d-10 1298 -5568 15.6 10200 282.7 Tartu d-01 750 -384 580.9 5936 280.7 Tartu d-02 750 -72 640.1 6296 305.4 Tartu d-03 750 -640 693.5 7192 375.1 Tartu d-04 750 -672 424.9 9240 328.5 Tartu d-05 750 -1248 135.5 12776 209.7 Tartu d-06 750 -1240 46.9 16576 202.5 Tartu d-07 750 -3640 41.4 14312 198.3 Tartu d-08 750 -3904 43.3 9536 191.1 Tartu d-09 750 -6800 50.3 7360 259.2 Tartu d-10 750 -8000 51.0 14272 475.6
Station Quantity Days Min Ave Max Sigma Tahk d-01 1100 -563 507.7 2372 230.0 Tahk d-02 1100 -531 929.2 3592 382.2 Tahk d-03 1100 -448 888.5 3512 404.0 Tahk d-04 1100 -900 395.8 3919 279.4 Tahk d-05 1103 -416 187.2 10461 289.1 Tahk d-06 1102 -679 149.3 2812 188.4 Tahk d-07 1102 -503 76.4 2501 121.0 Tahk d-08 1102 -840 98.3 2489 158.6 Tahk d-09 1102 -792 152.2 3082 246.5 Tahk d-10 1102 -852 223.0 3741 371.1 Tahk d-11 1103 -991 318.5 13703 576.1 Tahk d-12 1103 -894 423.0 10357 431.1 Tahk d-13 1103 -827 650.3 11369 501.1 Tahk d-14 1103 -962 995.1 13466 677.6 Tahk d-15 1103 -955 1464.8 19793 978.2 Tahk d-16 1103 -951 2286.7 18459 1426.4 Tahk d-17 1103 -845 3111.7 28679 2021.9 Tahk d-18 1103 0 3893.4 40320 2866.8
Average size distribution of negative ions
Average size distribution of negative ions
How to use? Empty folders
time shift from UT, s (Helsinki winter time) tolerable percent of errors how many headerlines delimiter (tab) base year time pattern time shift from UT, s (Helsinki winter time) variable name in table code of a missing value station factor table/diary quantity name in diary time regime
Double- click Result
time shift (s) (Helsinki winter time) tolerable percent of errors how many headerlines delimiter (tab) base year time pattern time shift (s) (Helsinki winter time) variable name in table missing code station factor table/diary quantity name in diary time regime
Why DataDiurna? A rectangular table can be conveniently processed and analyzed with common software like MS Excel (if the number of rows < 65536). Different quantities are often saved with different time steps and many cells in a tectangular table will be empty. If a table holds a large complex dataset then it is composed mostly of the missing value codes like –999. An alternative is a set of different tables joined into a database. Common database systems, e.g. MS Access, are optimized for the business applications. The time structure and the specifity of environmental measurements is not sufficiently exploited in these systems. Another alternative is the diary format well known in hand-written measurement records. This alternative is applied in DataDiurna, which is optimized for managing of environmental measurements. An additional advantage: the rules of DataDiurna force the user to include minimum explanations into the dataset, e.g. the data cannot be saved without indicating how much the time is shifted from UT.
A source table An outlet table A source table Common diary A source table NB: DataDiurna is not a replacement for Excel 1) to manage large data: choose DataDiurna, 2) to analyze the tables: choose Statistica, Excel, etc.
Excerpt from a diary: Time regime Date Quantity Station 20041110,Tartu.m,Rad:W/m2,hm,2,2,2,2,2,2,7,20,28,32,39,33,18,8,2,2,2,2,2,2,2,2,2,2 20041110,Tartu.m,Wind:m/s,hm,3.8,3.8,3.9,4.4,4.9,4.9,4.4,2.4,1.8,1.8,2.5,2.1,2,1.9,0.7,1.7,2,1.5,2,1,1.2,1.1,1.3,1.1 20041111,Hyyt,L+:fS/m,m10,9.6,9.1,8.7,8.9,9.1,9.4,10.2,9.8,9.6,10.3,10.9,11.4,11.6,12.2,12.5,11.8,11.9,12.2,12.7,12.5,12.1,11.9,11.3,11,11.6,11.5,11.3,10.9,11.2,11.5,11.5,11.3,11,10.7,11.1,11.5,11.1,11,,,,10.1,10.1,10,9.9,10.1,10.2,10.2,10,10.3,10.8,11.4,11.2,11.1,11.4,11.4,11.3,11.2,11.7,12,11.5,12.2,12.9,12.7,12.9,13.1,13.1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, 20041111,Hyyt,Rad:W/m2,m30,0,0,0,0,0,0,0,0,0,0,0,0,1,1,2,3,5,4,7,15,24,23,24,28,15,10,3,1,0,0,0,0,0,0,-1,0,0,0,0,0,0,-1,-1,0,0,0,0,0 Time regime Date Quantity Station Here are missing data
DataDiurna is omnivorous: The time stamp and the numbers can be written as you data provider likes, e.g. following data rows, where the time stamp is highlighted, are considered equal Time 13:25 Date 21.05.2005 23.1,-17,?,1.6e-19,2783597+003 2005-05-21 13:25:00,23.1,-17,x,1.60-19,0.2783597e3 DOY=141.559, measurements: +23.10 -17 -999 278.3597 20050521132500) 23.1;-17;-999;2,783,597e-4 The time is automatically converted into UT when importing a source table to the diary and to any time zone when exporting an output table from the diary. The rows of the source table may be non-uniformly distributed in time: DataDiurna can interpolate and average the data during the import and export. The code of missing value may be individual for every variable and may be numerical (e.g. –999) or non-numerical (e.g. ?). etc... etc... etc... ....... Warning: The data manager DD2007T.exe is still a beta-version.
The basic version of the dataset FINESTION-2003-2006 was composed from a large number of source tables with different time steps from 10 minutes to 1 hour
The working version of the dataset FINESTION-2003-2006 is located in the computer of the user Thus: Every user can create extended personal version of the dataset importing additional private data
NB: You should have 200 MB free on your hard disk. How to get? The DataDiurna data manager is free for unlimited distribution. The dataset ATMEL2007A is free for unlimited distribution. Both they can be retrieved from the page: http://ael.physic.ut.ee/tammet/dd/ The dataset FINESTION-2003-2006 is free but available only for our research group. It is not published in Internet today. Thus: Ask the memory stick or the CD with the dataset and copy the folder DD into your own computer. NB: You should have 200 MB free on your hard disk.
Thank you for choosing DataDiurna Thank you for choosing DataDiurna .....hard to learn but easy to use when managing large complex datasets like FINESTION-2003-2006