Roche
diff --git a/‎CITATION.cff‎
Lines changed: 1 addition & 1 deletion b/‎CITATION.cff‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎README.md‎
Lines changed: 12 additions & 6 deletions b/‎README.md‎
Lines changed: 12 additions & 6 deletions
diff --git a/‎change_log.md‎
Lines changed: 7 additions & 0 deletions b/‎change_log.md‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎docs/_build/doctrees/environment.pickle‎
123 Bytes b/‎docs/_build/doctrees/environment.pickle‎
123 Bytes
diff --git a/‎docs/_build/doctrees/index.doctree‎
9.64 KB b/‎docs/_build/doctrees/index.doctree‎
9.64 KB
diff --git a/‎docs/_build/html/.buildinfo‎
Lines changed: 1 addition & 1 deletion b/‎docs/_build/html/.buildinfo‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/_build/html/_static/documentation_options.js‎
Lines changed: 1 addition & 1 deletion b/‎docs/_build/html/_static/documentation_options.js‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/_build/html/genindex.html‎
Lines changed: 2 additions & 2 deletions b/‎docs/_build/html/genindex.html‎
Lines changed: 2 additions & 2 deletions
@@ -5,7 +5,7 @@ authors:
   given-names: "Otto"
   orcid: "https://orcid.org/0000-0002-3363-9287"
 title: "Pyreadstat"
-version: 1.2.8
+version: 1.2.9
 doi: 10.5281/zenodo.6612282
 date-released: 2018-09-24
 url: "https://github.com/Roche/pyreadstat"
@@ -333,7 +333,8 @@ df, meta = pyreadstat.read_sas7bdat('/path/to/a/file.sas7bdat', usecols=["variab
 A challenge when reading large files is the time consumed in the operation. In order to alleviate this
 pyreadstat provides a function "read\_file\_multiprocessing" to read a file in parallel processes using
  the python multiprocessing library. As it reads the whole file in one go you need to have enough RAM for the operation. If
-that is not the case look at Reading rows in chunks (next section)
+that is not the case look at Reading rows in chunks (next section). Notice however that you can combine reading in parallel
+with reading in chunks as described in the next section.
 
 Speed ups in the process will depend on a number of factors such as number of processes available, RAM, 
 content of the file etc.
@@ -598,11 +599,17 @@ translated as NaN by default and to the correspoding string value if
 user_missing is set to True. meta.missing_ranges will show the string
 value as well.
 
-If the value in
+When writing a pandas dataframe to a sav file, if user defined missing values are not set, NaNs are translated to
+empty strings, as there is no other possibility to represent those missing values and user defined missing values
+are not set automatically.
+
+When reading a sav into a pandas dataframe, if the value in
 a character variable is an empty string (''), it will not be translated to NaN, but will stay as an empty string. This
-is because the empty string is a valid character value in SPSS and pyreadstat preserves that property. You can convert
+is because the empty string is a valid character value in SPSS and pyreadstat preserves that property.
+
+This behaviour generates an asymetrical situation that has to be managed by the user. You can convert
 empty strings to nan very easily with pandas if you think it is appropiate
-for your dataset.
+for your dataset, or you can use defined missing values as described before.
 
 
 ##### SAS and STATA
@@ -641,7 +648,6 @@ df, meta = pyreadstat.read_dta("/path/to/file.dta", user_missing=True, apply_val
 
 Empty strings are still transtaled as empty strings and not as NaN.
 
-
 The information about what values are user missing is stored in the meta object, in the variable missing_user_values.
 This is a list listing all user defined missing values.
 
@@ -798,7 +804,7 @@ pyreadstat.write_sav(df, path, variable_format=formats)
 ```
 
 The appropiate formats to use are beyond the scope of this documentation. Probably you want to read a file
-produced in the original application and use meta.original_value_formats to get the formats. Otherwise look
+produced in the original application and use meta.original_variable\_types to get the formats. Otherwise look
 for the documentation of the original application.
 
 ##### SPSS
 
@@ -1,3 +1,10 @@
+# 1.2.9 (github, pypi and conda 2025.05.17)
+* Better error reporting when writing a column with an empty name, solves #276
+* added extra_time_formats, solves #283
+* changed empty original_variable_type from 'NULL' to None solves #287
+* implemented string_ref for large strings when writing DTA solves #268
+* updated Readstat sources to commit a000e9c88fee1a003a60b3a86ef5a0ed2b38e56e (March 24th 2025)
+
 # 1.2.8 (github, pypi and conda 2024.10.18)
 * Added Multiple Reponse Data Sets for SAV files #259
 * Fixed pyreadstat not raising error if folder does not exists when writing #269
 
@@ -1,4 +1,4 @@
 # Sphinx build info version 1
 # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
-config: 3684e396bb8bf7df6bbcdabbae81c680
+config: 06953f0822da5e417a66bd7a23c666a3
 tags: 645f666f9bcd5a90fca523b33c5a78b7
@@ -1,5 +1,5 @@
 const DOCUMENTATION_OPTIONS = {
-    VERSION: '1.2.8',
+    VERSION: '1.2.9',
     LANGUAGE: 'en',
     COLLAPSE_INDEX: false,
     BUILDER: 'html',
 
@@ -3,7 +3,7 @@
 <head>
   <meta charset="utf-8" />
   <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-  <title>Index &mdash; pyreadstat 1.2.8 documentation</title>
+  <title>Index &mdash; pyreadstat 1.2.9 documentation</title>
       <link rel="stylesheet" type="text/css" href="_static/pygments.css?v=fa44fd50" />
       <link rel="stylesheet" type="text/css" href="_static/css/theme.css?v=19f00094" />
 
@@ -12,7 +12,7 @@
     <script src="_static/js/html5shiv.min.js"></script>
   <![endif]-->
 
-        <script src="_static/documentation_options.js?v=4d6f9085"></script>
+        <script src="_static/documentation_options.js?v=e917149c"></script>
         <script src="_static/doctools.js?v=9a2dae69"></script>
         <script src="_static/sphinx_highlight.js?v=dc90522c"></script>
     <script src="_static/js/theme.js"></script>