This function is used to generate, from the BIN file(s), a list of values of: Single-grain OSL intensities and associated uncertainties, regenerative doses, etc., which will be the input of the Bayesian models. To be easy-to-use, this function requires a rigorous organisation - all needed files should be arranged in one folder - of informations concerning each BIN file.
It is possible to process data for various samples simultaneously and to consider more than one BIN file per sample.

Generate_DataFile(
  Path,
  FolderNames,
  Nb_sample,
  Nb_binfile = length(FolderNames),
  BinPerSample = rep(1, Nb_sample),
  sepDP = c(","),
  sepDE = c(","),
  sepDS = c(","),
  sepR = c("="),
  verbose = TRUE,
  ...
)

Arguments

Path

character (required): the path to the project folder, containing one or more sub folders in which the BIN files are located. If it is not equal to "", it must be terminated by "/".

FolderNames

character (required): list of names of the sub-folders containing the BIN files

  • each sub folder must contain a BIN file and associated csv files. See details for more informations on associated csv files required in the sub folders. If there is more than one BIN file per sample, see the details section for instructions regarding how to correctly fill the FolderNames vector.

Nb_sample

integer (required): number of samples.

Nb_binfile

integer (with default): number of BIN files. It must be equal to, or greater than Nb_sample.

BinPerSample

integer vector (with default): vector with the number of BIN files per sample. The length of this vector must be equal to Nb_sample and the sum of entries of this vector must be equal to Nb_binfile. If there is more than one BIN file per sample, see the details section for instructions regarding how to correctly fill BinPerSample vector. Otherwise, this vector must contain a list of 1 values.

sepDP

character (with default): column separator in the DiscPose.csv files.

sepDE

character (with default): column separator in the DoseEnv.csv files.

sepDS

character (with default): column separator in the `DoseLab.csv`` files.

sepR

character (with default): column separator in the Rule.csv files.

verbose

logical (with default): enable/disable verbose mode

...

further arguments that can be passed to Luminescence::read_BIN2R.

Value

A list containing the following objects:

  • LT (one list per sample); each list contains all L/T values for the corresponding sample;

  • sLT (one list per sample); each list contains all uncertainties on L/T values for the corresponding sample;

  • ITimes (one list per sample); each list contains irradiation time values for the corresponding sample;

  • dLab, a matrix containing in line i, the laboratory dose rate and its variance for sample i;

  • ddot_env, a matrix containing in line i, the environmental dose rate and its variance (excluding the common error terms) for sample i;

  • regDose (one list per sample); each list contains all regenerated doses;

  • J, a vector giving, for each BIN file, the number of aliquots selected for the analysis;

  • K, a vector giving, for each BIN file, the number of regenerative doses in the SAR protocol;

  • Nb_measurement, a vector giving, for each BIN file, the number of measurements.

** How to save this list **

You can save this list in a .RData object. To do this, you can use the fonction save. Then, to load this list you can use the function load (see example section fore more details).

Details

With Path and FolderNames, this function goes to the sub folders containing the BIN files and associated information to compute the luminescence data.

** What are the required files in each subfolder? **

Each sub folder can be named, for example, as the sample name followed by a number; it must contain:

  • bin.bin: the bin file renamed as bin.BIN (note: the name of all files matters);

  • DiscPos.csv: a two columns csv file containing the list of disc and grain position number of the previously selected grains (typically this list will include the position of grains based on their sensitivity, recycling or other properties);

  • DoseEnv.csv: a two columns file containing the observation of the natural (or environmental), dose rate, and its non-shared variance (i.e. after removing all shared errors), both in Gy. Note: the user shall provide the squared value of the error associated with the dose rate experienced by the sample grains in nature;

  • DoseSourve.csv: a two columns file containing the observation of the laboratory dose rate, and its variance (squared error) both in Gy;

  • rule.csv: a csv file containing information on

    • beginSignal= the first channel for summing the natural or regenerative OSL signal (typically 1 or 6);

    • endSignal= the last channel for summing the natural or regenerative OSL signal (typically 5 or 10);

    • beginBackground= the first channel for background estimation of the natural or regenerative OSL signal (typically 76 or 81);

    • endBackground= the last channel for background estimation of the natural or regenerative OSL signal (typically 95 or 100);

    • beginTest=,

    • endTest=,

    • beginTestBackground=,

    • endTestBackground= same values as above, for the test dose response (typically the same values should be used);

    • inflatePercent= uncertainty arising from the instrument reproducibility (typically 0.02, i.e. 2\

    • nbOfLastCycleToRemove= number of cycles at the end of the SAR protocol which should not be included in the dose response curve fitting (typically 1 if only a recycling test is performed, or 2 if both recycling and IR depletion are tested).

** How to fill the FolderNames vector? **

FolderNames is a vector of length Nb_binfile. FolderNames[i] is the name (e.g., Sample1-File1, or successive names separated by "/" signs, if BIN files are in subfolders, e.g. Sample1/File1) of the subfolder containing all informations on the BIN file of ID number i. The names in FolderNames are ordered following two rules:

  • The names in the FolderNames vector must be ordered following the sample order (the names of subfolders containing BIN files for the same sample should follow each other in the FolderNames vector, e.g. Sample1, Sample2-File1, Sample2-File2, etc.).

  • If stratigraphic constraints apply to samples, and so a Bayesian model with stratigraphic constraints is implemented, then the names in the FolderNames vector must be ordered by order of increasing ages.
    For example, FolderNames=c(noun1,noun2), in which case noun1 (respectively, noun2) corresponds to the subfolder name containing the BIN file of sample 1 (respectively of sample 2). In addition, if we know that sample 1 is younger than sample 2, then FolderNames vector is correctly filled.
    If conversely, FolderNames=c(noun2,noun1), the analysis performed by AgeS_Computation would not be consistent.

** How to fill the BinPerSample vector? **

BinPerSample[i] correponds to the number of BIN files for the sample whose number ID is equal to i.
For example, let us consider a case with two samples (Sample1 and Sample2), with 2 BIN files for Sample1 and 1 for Sample2. In this case, Nb_binfile=3 and Nb_sample=2. The user may then set FolderNames=c("Sample1-File1", "Sample1-File2", "Sample2-File1"), in which case "Sample1-File1" is the name of the subfolder containing the first BIN file for Sample1, "Sample1-File2" the name of the subfolder for the second BIN file of Sample1; eventually, "Sample2-File1" is the name of the subfolder containing the BIN file for the second sample. In this case, BinPerSample=c(2,1).

For the general BIN-file structure, the reader is referred to the following website: http://www.nutech.dtu.dk/

The function read_BIN2R developped in Luminescence package is used to read the BIN files.

Author

Claire Christophe, Sebastian Kreutzer, Anne Philippe, Guillaume Guerin

How to cite

Christophe, C., Kreutzer, S., Philippe, A., 2024. Generate_DataFile-deprecated(): Generates, from one (or several) BIN-file(s) of Single-grain OSL measurements, a list of luminescence data and information before statistical analysis (DEPRECATED). In: Christophe, C., Philippe, A., Kreutzer, S., Guérin, G., Baumgarten, F.H., Frerebeau, N., 2024. BayLum: Chronological Bayesian Models Integrating Optically Stimulated. R package version 0.3.2. https://CRAN.r-project.org/package=BayLum

Examples


if (FALSE) {
## Example for one sample with one Bin File
path<- system.file("extdata/samp1", "", package="BayLum")
folder=""
nbsample=1  # give the number of sample
Data <- Generate_DataFile(
 Path = path,
 FolderNames = folder,
 Nb_sample = nbsample,
 verbose = FALSE)
str(Data)

## to save information in RData object in folder containing bin file
# save(Data,file=c(paste(path,folder,'Data.RData',sep="")))
## to load information containing Data.RData object
# load(file=c(paste(path,folder,"Data.RData",sep="")))
}