Description: data on wheat yields taken from a 20×25 grid of plots in Great Knott field at Rothamsted in the summer 1910.
Ozone data (Chapter 5)
Description: 8 hour average daily ozone in parts per billion (PPB) for 513 sites in the US over the ozone "season" which runs from April through October (184 days) for five years (1995 - 1999). See here for detailed description and a preliminary analysis.
Source:
see Doug Nychka's webpage
Dataset: ozmax8.dat and ozmax8.info.q.
Nyssa data (Chapter 5)
Description: Location of male, female, and juvenile tupelo trees in 3 plots in the Savannah River, Barnwell County, South Carolina. (Legend: plot = number of the plot, sex = sex of the tree , x, y = coordinates).
Source: Shea,
M.M., Dixon, P.M. and Sharitz R.R. (1993), Size differences, sex ratio,
and spatial distribution of male and female water tupelo, Nyssa
aquatica (Nyssaceae), American
Journal of Botany, 80,
26--30. (Courtesy Philip Dixon).
Dataset: tupelo.
Cancer poumon data (Chapter 5)
Description: Male lung cancer
mortality rate standardized over the age range 35-74 and
over 2 year period, 1968-1969 at the
scale of the French départments. Covariates are pecentage of employed males in
specific types of industry ( metal, general engineering, textile),
cigarette sales.
Source:
Richarson, S., Guihenneuc, C. and Lasserre, V. (1992), Spatial linear
models with autocorrelated error structure, The Statistician, 41, 539-557. (Courtesy Chantal
Guihenneuc-Jouyaux).
Dataset: cancer, cancer-voisinage (incidence
matrix)
Nebrodi data (Chapter 5)
Description: remote sensed image from Landsat TM
(Thematic Mapper) of the Nebrodi Mountains. The image covers a surface
of 2016 km^2 and is composed by 2240 (40 vertical, 56 horizontal)
pixels
pixels, each one representing a ground surface of 900 m^2. Black pixel
(1) corresponds to a farmed area, white pixel (0) corresponds
to non-farmed area. The covariates are soil
characteristics: soil density, measured in grammes per cm^3 (SD);
horizon thickness, in centimeters (HT); mean yearly rainfall in
centimeters (MYR); permeation capacity in saturation conditions, in
millimeters per day (PC); digital terrain model, in meters (DTM).
Source: Alfò, M. and Postiglione, P. (2002), Semiparametric modelling of spatial binary observations, Statistical Modelling, 2, 123-137. (Courtesy Marco Alfò). See also the Archives section at the Statistical Modelling Society.
Dataset: nebrosdata.
Tuscany cancer data (Chapter 5)
Description:
Lung cancer death
certificates for males resident in the 287 municipalities of the
Tuscany region (Italy) from 1971 to 1999. Covariate is the
education score which is derived from censuses of the period 1921–91
and is defined as prevalence
of less educated people and was measured on a relative scale, defining
a different threshold for ‘low
educational level’ at each census.
Source: Catelan, D., Biggeri, A., Dreassi, E. and Lagazio, C. [2006], Space-cohort Bayesian models in ecological studies, Statistical Modelling, 6, 159-173. (Courtesy Emanuela Dreassi and Corrado Lagazio). See also the Archives section at the Statistical Modelling Society.
Dataset: observed_lung_cancer, expected_lung_cancer, educational_index
Lavender data (Chapter 5)
Description: Decline of lavender plants due to a phytoplasm in a plot of 11 rows. Each row contains 80 plants separated by 20 cm. The severity of the decline is on a scale from 0 to 5.
Source: Peyrard,
N., Calonnec, A., Bonnot, F. Chadoeuf, J. (2005), Explorer un jeu de
données sur grille par test de permutation, Revue de Statistique
Appliquée, LIII, 59-78.
(Courtesy
Joel Chadoeuf).
Dataset: lavender.
Retention potential data (Chapter 5)
Description: Retention potential data at 3 different
pressure (W5, W200, W1500) . Covariates are granulometric
fractions of clay and silt with 4 different classes from the
coarsest to the finest (LF, F3, F4.5, F6.7)
Source: Courtesy
Michel Goulard.
Dataset: potentials.
Castanea data (Chapter 5)
Description: distribution of about 180 plants of
Castanea pumila (specie 38) in the state of Florida, and data on the
values of four climate variables which were expected to be important
factors in determining the distribution of the plant species (FZF =
median freeze-free period in days,TAV = mean annual temperature in
degrees Celsius, PRCP = mean total annual precipitation in millimetres, MI = moisture index =
(PRCP)/(TAV x 58.93), where TAV x 58.93 = estimate of mean annual
potential evapotranspiration by the Holdridge method)
Source: Wu, H. and Huffer, F.W.
(1997), Modelling the Distribution of Plant Species Using Autologistic
Regression Model, Environmental and Ecological Statistics, 4, 49-64. See
Fred Huffer's page
Dataset: sp38data.
Indigo data (Chapter 5)
Description: Spatial distribution of the recorded
birds belonging to the Indigo bunting family. A total of 624 points in
the Southern Piedmont region (USA)
were
surveyed between 1 June and 17 July in years 2002. Five land use
predictors ( measured within 1 km around the count points) are
considered: natural woodland, denoted by MK, open parks, hayfields, pasture, denoted
by GK, early successional forests, denoted by TK,
impervious surfaces (roads, housing), denoted by UK and pine
plantations, denoted by PK.
Source: Strathford,
J.A and Robinson, W.D. (2005). Distribution of neotropical
migratory bird species across
an urbanizing landscape. Urban Ecosystems, 8, 59--77. (Courtesy
Jeffrey Strathford).
Dataset: indigo , coords-indigo (coordinates).