From 4cba7672fefff7d995e6ff474640a74f912d30eb Mon Sep 17 00:00:00 2001
From: Shay Palachy <shaypal5@users.noreply.github.com>
Date: Fri, 29 May 2026 13:26:25 +0300
Subject: [PATCH] fix(tests): add author profile URLs to HF preview allowlist +
 regen sample

The attribution links added to release/README.md (shaypalachy.com,
huggingface.co/shaypal5, kaggle.com/derelictpanda, github.com/shaypalachy)
were not in _LINK_OK_PREFIXES, causing test_public_rendered_links_point_at_known_targets
to fail.  Also regenerate the committed preview sample to match the
updated README.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 .../huggingface_public.html                   | 25 +++++++++++--------
 tests/scripts/test_preview_hf_page.py         |  5 ++++
 2 files changed, 19 insertions(+), 11 deletions(-)
diff --git a/release/_preview_committed/huggingface_public.html b/release/_preview_committed/huggingface_public.html
index 0bd8e75..757bdd7 100644
--- a/release/_preview_committed/huggingface_public.html
+++ b/release/_preview_committed/huggingface_public.html
@@ -167,7 +167,8 @@ <h2 class="section__heading">Configurations / Subsets <span class="section__coun
 <section class="readme">
 <h1>LeadForge: Synthetic B2B Lead Scoring Dataset (<code>leadforge-lead-scoring-v1</code>)</h1>
 <p>A relational, reproducible, three-tier synthetic CRM dataset family for
-teaching lead scoring at scale. Generated by
+teaching lead scoring at scale. Created by
+<a href="https://www.shaypalachy.com/">Shay Palachy Affek</a> and generated by
 <a href="https://github.com/leadforge-dev/leadforge">leadforge</a>, an
 open-source Python framework for synthetic CRM/funnel data. The
 framework version is decoupled from the dataset version: the package
@@ -175,16 +176,14 @@ <h1>LeadForge: Synthetic B2B Lead Scoring Dataset (<code>leadforge-lead-scoring-
 tag.</p>
 <h2>Why lead scoring matters in 2024–2026</h2>
 <p>Mid-market SaaS vendors entered 2024–2026 with growth slowing and
-customer-acquisition costs rising[^macro], so predicting <em>which</em> leads
-convert within a fixed window has moved from a marketing nicety to a
-survival skill. This dataset teaches that skill on a relational
-substrate, with the realistic confusions (snapshot-window discipline,
-leakage traps, channel signal weaker than vendor blogs imply) that
-students will hit when they finally get hands on real CRM data.</p>
-<p>[^macro]: Macroeconomic framing summarised in
-<a href="https://github.com/leadforge-dev/leadforge/blob/main/docs/external_review/summaries/gemini_v2_summary.md"><code>docs/external_review/summaries/gemini_v2_summary.md</code></a>
-(median public-SaaS growth 30%→25% from 2023 to 2025; New CAC Ratio
-rose materially in 2024).</p>
+customer-acquisition costs rising (median public-SaaS growth 30%→25%
+from 2023 to 2025; New CAC Ratio rose materially in 2024), so
+predicting <em>which</em> leads convert within a fixed window has moved from
+a marketing nicety to a survival skill. This dataset teaches that
+skill on a relational substrate, with the realistic confusions
+(snapshot-window discipline, leakage traps, channel signal weaker than
+vendor blogs imply) that students will hit when they finally get hands
+on real CRM data.</p>
 <h2>What's inside</h2>
 <pre><code>.
 ├── intro/ intermediate/ advanced/    # student_public bundles, one per difficulty tier
@@ -601,6 +600,10 @@ <h2>Maintenance, adversarial framing, license</h2>
 </table>
 <p>Verify integrity with <code>leadforge validate &lt;bundle_dir&gt;</code>; every file
 is hashed in <code>manifest.json</code>.</p>
+<h2>Credits</h2>
+<p>Created by <a href="https://www.shaypalachy.com/">Shay Palachy Affek</a>.
+Dataset generated with <a href="https://github.com/leadforge-dev/leadforge">leadforge</a> (MIT).
+Profiles: <a href="https://huggingface.co/shaypal5">HuggingFace</a> · <a href="https://www.kaggle.com/derelictpanda">Kaggle</a> · <a href="https://github.com/shaypalachy">GitHub</a></p>
 </section>
 <footer class="dataset-footer">
   <div class="dataset-footer__license">License: mit</div>
diff --git a/tests/scripts/test_preview_hf_page.py b/tests/scripts/test_preview_hf_page.py
index 4f7301a..c0081c8 100644
--- a/tests/scripts/test_preview_hf_page.py
+++ b/tests/scripts/test_preview_hf_page.py
@@ -49,7 +49,12 @@
 # ``test_preview_kaggle_page.py`` for rationale.
 _LINK_OK_PREFIXES = (
     "https://github.com/leadforge-dev/leadforge",
+    "https://github.com/shaypalachy",
     "https://huggingface.co/datasets/leadforge",
+    "https://huggingface.co/datasets/shaypal5",
+    "https://huggingface.co/shaypal5",
+    "https://www.kaggle.com/derelictpanda",
+    "https://www.shaypalachy.com/",
     "https://example.com",
     "LICENSE",
     "#",