Expanded math ops by ShrutheeshIR · Pull Request #114 · KavrakiLab/vamp

ShrutheeshIR · 2026-06-20T20:39:13Z

I want more math ops for fancier math. Main changes

Add asin, acos and atan inverse trigonometric functions in avx.hh. -- Use cephes implementations from here. Note that I have used _mm256_sqrt_ps ops here instead of the custom defined sqrt(), b/c the precision is pretty poor and acos has differences in the 3rd FP itself.
Add them to the interface as well
Expanded math.hh to include these ops and other operations that are useful in general

TODO --

Add them to neon
Add wasm

Tested it using

    size_t total_tests_failed = 0;
    std::cout << "Check atan2 implementation" << std::endl;
    for (auto i = -10; i <= 10; ++i)
    {
        for (auto j = -10; j <= 10; ++j)
        {
            if (i == 0 && j == 0)
            {
                continue;
            }
            float angle = std::atan2(static_cast<float>(i), static_cast<float>(j));

            // now convert it to a FloatVector<rake> and call atan
            vamp::FloatVector<rake> i_vec;
            i_vec[0] = static_cast<float>(i);
            vamp::FloatVector<rake> j_vec;
            j_vec[0] = static_cast<float>(j);
            auto angle_vec = atan2(i_vec, j_vec);

            if (std::abs(angle - angle_vec[{0, 0}]) > 1e-6)
            {
                std::cout << "Mismatch for (" << i << ", " << j << "): "
                          << "std::atan2 = " << angle << ", "
                          << "FloatVector<rake>::atan2 = " << angle_vec[{0, 0}] << std::endl;
                total_tests_failed++;
            }
        }
    }

    // check acos and asin   
    std::cout << "Check acos and asin implementation" << std::endl;
    for (auto i = -10; i <= 10; ++i)
    {
        float value = static_cast<float>(i) / 10.0f;  // values from -1.0 to 1.0
        if (value < -1.0f || value > 1.0f)
        {
            continue;  // skip out-of-range values
        }
        float angle = std::acos(value);
        float asin_angle = std::asin(value);

        vamp::FloatVector<rake> value_vec;
        value_vec[0] = value;
        auto angle_vec = acos(value_vec);
        auto asin_angle_vec = asin(value_vec);

        if (std::abs(angle - angle_vec[{0, 0}]) > 1e-6)
        {
            std::cout << "Mismatch for " << value << ": "
                      << "std::acos = " << angle << ", "
                      << "FloatVector<rake>::acos = " << angle_vec[{0, 0}] << std::endl;
            total_tests_failed++;
        }
        if (std::abs(asin_angle - asin_angle_vec[{0, 0}]) > 1e-6)
        {
            std::cout << "Mismatch for " << value << ": "
                      << "std::asin = " << asin_angle << ", "
                      << "FloatVector<rake>::asin = " << asin_angle_vec[{0, 0}] << std::endl;
            total_tests_failed++;
        }

    }
    std::cout << "Total tests failed : " << total_tests_failed << std::endl;

PS: The custom implementation of sqrt honestly is pretty bad for most use cases, I think the default behavior should be _mm256_sqrt_ps, and we can expose a sqrt_low_precision for other use cases (eg cc). This is the 2nd time I've wasted a couple of hours tracking down precision issues from my implementation only to realize sqrt is the culprit :(

ShrutheeshIR · 2026-06-20T21:13:11Z

+}
+
+template <typename DataA, typename DataB, typename DataC>
+inline constexpr auto blend(const DataA &a, const DataB &b, const DataC &mask ) -> DataC


Codegen can generate blend() ops, good to use it right

zkingston

Overall good, please add the NEON and WASM equivalents.

zkingston · 2026-06-20T23:15:32Z

+template <typename DataA, typename DataB, typename DataC>
+inline constexpr auto blend(const DataA &a, const DataB &b, const DataC &mask) -> DataC
+{
+    if constexpr (std::is_arithmetic_v<DataC>)
+    {
+        return (mask >= 0) ? a : b;
+    }
+    else
+    {
+        DataC a_vec(a);
+        DataC b_vec(b);
+        return a_vec.blend(b_vec, mask);
+    }
+}
+


What's this extra blend instruction?

Just a unified expr for scalar and a vectorized conditional expr
cppad sometimes spits out a
v[14] = blend(v[14], -0.9999, (v[14]) - (-0.9999)); I want to be able to handle that, so I'll just pin it to the datatype of the first arg

zkingston · 2026-06-22T18:37:27Z

Can you verify the formatting is correct and that all builds pass? Otherwise LGTM.

ShrutheeshIR added 2 commits June 20, 2026 16:27

Inverse trig ops

566bf96

fix acos sqrt call

57fc20b

ShrutheeshIR closed this Jun 20, 2026

Merge branch 'main' into expanded_math_ops

e6c582d

ShrutheeshIR reopened this Jun 20, 2026

ShrutheeshIR commented Jun 20, 2026

View reviewed changes

Comment thread src/impl/vamp/vector/isa/avx.hh

ShrutheeshIR commented Jun 20, 2026

View reviewed changes

clang formatted

cb92dc6

zkingston requested changes Jun 20, 2026

View reviewed changes

ShrutheeshIR added 6 commits June 21, 2026 15:26

got gpt to write me neon and wasm equivalents

ac0911a

fix wasm and neon

69b304d

fixed neon xor

99c1f90

fixed blend

f856ef6

modified blend return

85ae701

no andnot

6ca13b1

ShrutheeshIR requested a review from zkingston June 21, 2026 23:02

zkingston approved these changes Jun 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expanded math ops#114

Expanded math ops#114
ShrutheeshIR wants to merge 10 commits into
KavrakiLab:mainfrom
CoMMALab:expanded_math_ops

ShrutheeshIR commented Jun 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

ShrutheeshIR Jun 20, 2026

Uh oh!

zkingston left a comment

Uh oh!

zkingston Jun 20, 2026

Uh oh!

ShrutheeshIR Jun 21, 2026 •

edited

Loading

Uh oh!

zkingston commented Jun 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ShrutheeshIR commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ShrutheeshIR Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

zkingston left a comment

Choose a reason for hiding this comment

Uh oh!

zkingston Jun 20, 2026

Choose a reason for hiding this comment

Uh oh!

ShrutheeshIR Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zkingston commented Jun 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ShrutheeshIR commented Jun 20, 2026 •

edited

Loading

ShrutheeshIR Jun 21, 2026 •

edited

Loading