# Use Case 09: Preparation of datasets for `CRTspat`

`Usecase9.Rmd`

The starting point for the analyses is the co-ordinates for the
spatial units that are to be randomized. These can be provided as
lat-long coordinates, or as Cartesian point co-ordinates. Geolocated
data from completed field studies or trials in progress can be read into
`CRTspat`

in the form of data frames, preferably with one
record for each location in the trial. Pre-existing data from the field
should be coded as follows:

Description of variable | Variable name | Type and values |
---|---|---|

x-coordinates of locations | x | Numeric |

x-coordinates of locations | y | Numeric |

cluster assignment | cluster | factor |

arm assignment | arm | factor with levels “Control” and “Intervention” |

buffer assignment | buffer | logical (TRUE for locations in the buffer) |

Users might want to include an ID variable in the data frame if they intend to link the outputs back to other datasets. Other variables, including baseline data or trial outcomes may also be included.

Description of variable | Default Variable name | Type |
---|---|---|

Baseline numerator | base_num | Numeric |

Baseline denominator | base_denom | Numeric |

Numerator | num | Numeric |

Denominator | denom | Numeric |

Simulated co-ordinate sets can also be generated *de novo*
(function `CRTsp()`

)
for use in methods development and testing.

To make it possible to share data files without compromising
confidentiality the `anonymize_site()`

function is provided. This removes absolute geolocations and applies a
transformation to the coordinates to conserve distances between them but
modifying the orientation.

The `latlong_as_xy`

function is available to convert co-ordinates provided as decimal
degrees into Cartesian co-ordinates with units of km with centroid
(0,0). If the input co-ordinates are provided using a different
projection then they must be converted externally to the package.