You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
this is true if the table being summarized contains patient-level data
292
+
*and* the summarizing function is reporting on every value in one or more columns
293
+
*and* rare values are not being suppressed (leading to a value that applies
294
+
to just one or two individuals being released).
295
+
Search the ``src-stats.yaml`` file for comments such as:
296
+
297
+
All the values that appear in column *column-name* of table *table-name*
298
+
299
+
or
300
+
301
+
All the values that appear in column *column-name* of table *table-name* more than 7 times
302
+
303
+
Secondly, Vocabulary Tables' privacy risk is "None if correctly identified".
304
+
A Vocabulary Table is supposed to be a table simply providing categories for other tables to reference.
305
+
They are not changed during the operation of the database and so releasing them does not represent a privacy risk.
306
+
However, there is some flexibility here; a list of care provider institutions is not technically a vocabulary table
307
+
but it is probably safe to treat it as one.
308
+
The important point is that Datafaker allows the user to specify any table as a vocabulary table;
309
+
if the user incorrectly specifies sensitive data as Vocabulary, it must not be released!
244
310
245
311
Datafaker Repopulate phase
246
312
--------------------------
247
313
314
+
Once we have released the summary data as described above we can operate outside of the TRE
315
+
as the sensitive data is no longer accessed by Datafaker.
316
+
317
+
The remaining commands are:
318
+
248
319
- ``datafaker create-tables`` creates the structure of the destination database to match (as much as is requested) the structure of the source database
249
320
- ``datafaker create-generators`` creates Python code files that will actually generate the data (this phase might be removed in a future version of Datafaker)
250
321
- ``datafaker create-data`` writes fake data into the destination database.
0 commit comments