R/Generate_DataFile_MG.R
Generate_DataFile_MG-deprecated.Rd
This function is used to generate, from the BIN file(s), a list of values of:
Multi-grain OSL intensities and associated uncertainties, regenerative doses, etc., which will be the input of the Bayesian models.
To be easy-to-use, this function requires a rigorous organisation - all needed files should be arranged in one folder -
of informations concerning each BIN file.
It is possible to process data for various samples simultaneously and to consider more
than one BIN-file per sample.
character (required): the path to the project folder, containing one or more sub folders in which the BIN files
are located. If it is not equal to ""
, it must end with "/"
.
character (required) vector: list of names of the sub-folders containing the BIN files
each sub folder must contain a BIN file and associated csv files.
See details for more informations on associated csv files required in the sub folders.
If there is more than one BIN file per sample, see the details section for instructions regarding how to correctly fill the
FolderNames
vector.
integer (required): number of samples
integer (with default): number of BIN files. It must be equal to, or greater than Nb_sample
.
integer vector (with default): vector with the number of BIN files per sample.
The length of this vector must be equal to Nb_sample
and the sum of entries of this vector must be equal to Nb_binfile
.
If there is more than one BIN file per sample, see the details section for instructions regarding how to correctly fill BinPerSample
vector.
Otherwise, this vector must contain a list of 1 values.
character (with default): column separator in the DiscPose.csv
files.
character (with default): column separator in the DoseEnv.csv
files.
character (with default): column separator in the DoseLab.csv
files.
character (with default): column separator in the Rule.csv
files.
logical (with default): enable/disable verbose mode
logical (with default): if set to TRUE
, the order of the records is
pushed to follow the one "Run 1 at a time" order (this is, all sequence steps were performed on
one aliquot before moving to the next aliquot), regardless of their original sequence.
The default is FALSE
because 'BayLum'
assumes that the sample was measured with the
"Run 1 at a time" option (only Risø readers, lexsyg readers do not have another option).
In other words, the argument allows you to automatically correct your input data to follow the order 'BayLum'
expects.
Why isn't the default value TRUE
?. Because this re-ordering must fail
if a measurement position was used more than once for different samples!
This typically happens when different BIN/BINX files are merged.
further arguments that can be passed to Luminescence::read_BIN2R.
A list containing the following objects:
LT (one list per sample); each list contains all L/T values for the corresponding sample;
sLT (one list per sample); each list contains all uncertainties on L/T values for the corresponding sample;
ITimes (one list per sample); each list contains irradiation time values for the corresponding sample;
dLab, a matrix containing in line i
, the laboratory dose rate and its variance for sample i
;
ddot_env, a matrix containing in line i
, the environmental dose rate and its variance (excluding the common error terms) for sample i
;
regDose (one list per sample); each list contains all regenerated doses;
J, a vector giving, for each BIN file, the number of aliquots selected for the analysis;
K, a vector giving, for each BIN file, the number of regenerative doses in the SAR protocol;
Nb_measurement, a vector giving, for each BIN file, the number of measurements;
** How to save this list **
You can save this list in a .RData object. To do this, you can use the function save
.
Then, to load this list you can use the function load
(see example section fore more details).
With Path
and FolderNames
, this function goes to the sub folders containing the BIN files and associated information to compute
the luminescence data.
** What are the required files in each subfolder? **
Each subfolder can be named, for example, as the sample name followed by a number; it must contain:
bin.bin, the bin file renamed as bin.BIN (note: the name of all files matters);
Disc.csv, a one columns csv file containing the list of disc number of the previously selected grains (typically this list will include the position of grains based on their sensitivity, recycling or other properties);
DoseEnv.csv, a two columns file containing the observation of the natural (or environmental), dose rate, and its non-shared variance (i.e. after removing all shared errors), both in Gy. Note: the user shall provide the squared value of the error associated with the dose rate experienced by the sample grains in nature;
DoseSourve.csv, a two columns file containing the observation of the laboratory dose rate, and its variance (squared error), both in Gy;
rule.csv, a csv file containing information on
beginSignal=
the first channel for summing the natural or regenerative OSL signal (typically 1 or 6);
endSignal=
the last channel for summing the natural or regenerative OSL signal (typically 5 or 10);
beginBackground=
the first channel for background estimation of the natural or regenerative OSL signal (typically 76 or 81);
endBackground=
the last channel for background estimation of the natural or regenerative OSL signal (typically 95 or 100);
beginTest
,
endTest
,
beginTestBackground
,
endTestBackground=
same values as above, for the test dose response (typically the same values should be used);
inflatePercent=
uncertainty arising from the instrument reproducibility (typically 0.02, i.e. 2\
nbOfLastCycleToRemove=
number of cycles at the end of the SAR protocol which should not be included in the dose response curve fitting
(typically 1 if only a recycling test is performed, or 2 if both recycling and IR depletion are tested).
** How to fill the FolderNames
vector? **
FolderNames
is a vector of length Nb_binfile
. FolderNames[i]
is the name (e.g., Sample1-File1, or successive names separated by "/" signs,
if BIN files are in subfolders, e.g. Sample1/File1) of the subfolder containing all informations on the BIN file of ID number i
.
The names in FolderNames
are ordered following two rules:
The names in the FolderNames
vector must be ordered following the sample order
(the names of subfolders containing BIN files for the same sample should follow each other in the FolderNames
vector, e.g.
Sample1, Sample2-File1, Sample2-File2, etc.).
If stratigraphic constraints apply to samples, and so a Bayesian model with stratigraphic constraints is implemented,
then the names in the FolderNames
vector must be ordered by order of increasing ages.
For example, FolderNames=c(noun1,noun2)
, in which case noun1
(respectively, noun2
) corresponds to the subfolder
name containing the BIN file of sample 1 (respectively of sample 2).
In addition, if we know that sample 1 is younger than sample 2, then FolderNames
vector is correctly filled.
If conversely, FolderNames=c(noun2,noun1)
, the analysis performed by AgeS_Computation
would not be consistent.
** How to fill the BinPerSample
vector? **
BinPerSample[i]
corresponds to the number of BIN files for the sample whose number ID is equal to i
.
For example, let us consider a case with two samples (Sample1 and Sample2), with 2 BIN files for Sample1 and 1 for Sample2.
In this case, Nb_binfile
=3 and Nb_sample
=2.
The user may then set FolderNames=c("Sample1-File1", "Sample1-File2", "Sample2-File1")
, in which case "Sample1-1"
is the name of the subfolder containing the first BIN file
for Sample1, "Sample1-File2"
the name of the subfolder for the second BIN file of Sample1; eventually, "Sample2-1"
is the name of the subfolder containing the BIN file
for the second sample. In this case, BinPerSample=c(2,1)
.
For the general BIN-file structure, the reader is referred to the following website: http://www.nutech.dtu.dk/
The function Luminescence::read_BIN2R is used to read the BIN files.
The function imports only BIN/BINX-file records which have been previously selected.
Christophe, C., Kreutzer, S., Philippe, A., Guérin, G., 2024. Generate_DataFile_MG-deprecated(): Generates, from one (or several) BIN file(s) of Multi-grain OSL measurements a list of luminescence data and information before statistical analysis (DEPRECATED). In: Christophe, C., Philippe, A., Kreutzer, S., Guérin, G., Baumgarten, F.H., Frerebeau, N., 2024. BayLum: Chronological Bayesian Models Integrating Optically Stimulated. R package version 0.3.2. https://CRAN.r-project.org/package=BayLum
if (FALSE) {
path <- system.file("extdata/FER1", "", package="BayLum")
folder <- ""
# give the number of sample
nbsample <- 1
DATA <- Generate_DataFile_MG(
Path = path,
FolderNames = folder,
Nb_sample = nbsample)
str(DATA)
# to save information in RData object in folder containing bin file
#save(DATA,file=c(paste(path,folder,'DATA.RData',sep="")))
# to load information containing DATA.RData object
#load(file=c(paste(path,folder,"DATA.RData",sep="")))
}