Populate "high level" stl like algorithm #107

ThomasRetornaz · 2018-01-06T12:49:20Z

Hi
I currently migrate from boost:simd to libsimdpp
I heavily use transform and reduce algorithm from plain pointers and simd aware operators
I will try to implement such algorithm using libsimdpp
Are you interested if i providing such "high level" algorithm to the main library?

Possible signature for transform could be
template<typename T, typename U, typename UnOp> U* transform(T const* first, T const* last, U* out, UnOp f) { ..... }
where UnOp should be designed by users to handle both litterals and "simd vectors"

Transform functions must handle

prelude (if they are not musch element to fit in simd register)
main simd part (element wich fit in simd register and use simd load/store)
epilogue (remaining element which not fit in simd register)

I will add

few traits to pick up the most reliable number of elements for simd part
isaligned function to switch beetween load load_u and store and store_u (which seem missing)
transform and reduce algorithm (for a begining)

Do you have any concern where i should put those different functions
I will make a pull request if you are interested

The text was updated successfully, but these errors were encountered:

p12tic · 2018-01-08T00:06:13Z

Hi, thanks for interest. It would be great to have this functionality in libsimdpp.

Do you have any concern where i should put those different functions

I think it doesn't matter as it's not hard to move code and libsimdpp currently does not expose the location of individual headers. At the beginning we could put generic algorithms to simdpp/algirothm folder and see later if there's better place.

@xugng FYI. Do you already work on something like this by chance?

p12tic · 2018-01-14T19:29:37Z

@xugng: Ping :-)

xugng · 2018-01-15T16:43:12Z

@p12tic, @ThomasRetornaz : No, I not working on this. Feel free to hack.

ThomasRetornaz · 2018-01-20T06:14:33Z

HI i will make a pull request on std like transform algorithm but i have a few concern

I can't generate the documentation and check my additions because http://doc.radix.lt/libsimdpp/ is unreachable
i have to add a define call SIMDPP_IDEAL_MAX_ALIGN_BYTES (a la EIGEN) to dispatch on best default alignement depending on literal type. Does it seems to make sens to you (see below)? or other define preexist and/or these information is provided somewhere ?.

#if SIMDPP_USE_NULL
#define SIMDPP_IDEAL_MAX_ALIGN_BYTES 1
#elif SIMDPP_USE_AVX512F
#define SIMDPP_IDEAL_MAX_ALIGN_BYTES 64
#elif SIMDPP_USE_AVX
#define SIMDPP_IDEAL_MAX_ALIGN_BYTES 32
#else
#define SIMDPP_IDEAL_MAX_ALIGN_BYTES 16
#endif

/// TypeTraits int8_t
template<>
struct TypeTraits <int8_t>
{
static const size_t SIMDPP_FAST_SIZE = SIMDPP_FAST_INT8_SIZE;
using simd_type = int8<SIMDPP_FAST_SIZE>;
static const size_t alignement = SIMDPP_IDEAL_MAX_ALIGN_BYTES;
};

Regards
TR

p12tic · 2018-01-21T19:44:20Z

I can't generate the documentation and check my additions because http://doc.radix.lt/libsimdpp/ is unreachable

I disabled public access to it due to hacking concerns. Could you email me at [email protected] and I'll send you instructions to access it and credentials needed for that.

i have to add a define call SIMDPP_IDEAL_MAX_ALIGN_BYTES <...>

The ideal alignment should differ per type - e.g. on AVX integer types only need to be 128-bit aligned whereas float types need to be 256-bit aligned. The alignment could be specified directly in the TypeTraits specializations, e.g. static const size_t alignment = 1 * fast_size.

Also a couple of naming nitpicks: TypeTraits => type_traits, SIMDPP_FAST_SIZE => fast_size.

Does that make sense to you?

Thanks!

ThomasRetornaz · 2018-01-22T16:38:21Z

disabled public access to it due to hacking concerns. Could you email me at [email protected] and I'll send you instructions to access it and credentials needed for that.

Thanks i will send an email

The ideal alignment should differ per type - e.g. on AVX integer types only need to be 128-bit aligned whereas float types need to be 256-bit aligned. The alignment could be specified directly in the TypeTraits specializations, e.g. static const size_t alignment = 1 * fast_size.

Ok i miss this. I'm new on avx/avx2 instructions sets sorry ...
Nevertheless alignement can't be equal to 1 * fast_size. If i understand it should be 32 bytes for float types and 16 bytes for interger types on AVX or fast_size==4 for double and ==8 for float which it make sens if fast_size code the "best possible size" for simd pack
I don't found a macro and/or mathematical operation which could link fast_size and "alignement" in AVX case
Do i need to make a "dispatch" regarding arch in typetraits to handle this?
May i miss something stupid
Regards
TR

ThomasRetornaz · 2018-01-29T06:51:43Z

The ideal alignment should differ per type - e.g. on AVX integer types only need to be 128-bit aligned whereas float types need to be 256-bit aligned. The alignment could be specified directly in the TypeTraits specializations, e.g. static const size_t alignment = 1 * fast_size.

Hi i converge to this

`

  /// Define typetraits  
    template<class valuetype>
    struct typetraits
    {
        static const size_t alignment = std::alignment_of<valuetype>::value; 
    };

    /// typetraits int8_t
    template<>
    struct typetraits <int8_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT8_SIZE;
        using simd_type = int8<fast_size>;
        static const size_t alignment = fast_size;
    };
    /// typetraits uint8_t
    template<>
    struct typetraits <uint8_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT8_SIZE;
        using simd_type = uint8<fast_size>;
        static const size_t alignment = fast_size;
    };

    /// typetraits int16_t
    template<>
    struct typetraits <int16_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT16_SIZE;
        using simd_type = int16<fast_size>;
        static const size_t alignment = fast_size * 2;
    };
    /// typetraits uint16_t
    template<>
    struct typetraits <uint16_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT16_SIZE;
        using simd_type = uint16<fast_size>;
        static const size_t alignment = fast_size * 2;
    };

    /// typetraits int32_t
    template<>
    struct typetraits <int32_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT32_SIZE;
        using simd_type = int32<fast_size>;
        static const size_t alignment = fast_size * 4;
    };
    /// typetraits uint32_t
    template<>
    struct typetraits <uint32_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT32_SIZE;
        using simd_type = uint32<fast_size>;
        static const size_t alignment = fast_size * 4;
    };

    /// typetraits int64_t
    template<>
    struct typetraits <int64_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT64_SIZE;
        using simd_type = int64<fast_size>;
        static const size_t alignment = fast_size * 8;
    };

    /// typetraits uint64_t
    template<>
    struct typetraits <uint64_t>
    {
        static const size_t fast_size = SIMDPP_FAST_INT64_SIZE;
        using simd_type = uint64<fast_size>;
        static const size_t alignment = fast_size * 8;
    };

    /// typetraits float32
    template<>
    struct typetraits <float>
    {
        static const size_t fast_size = SIMDPP_FAST_FLOAT32_SIZE;
        using simd_type = float32<fast_size>;
        static const size_t alignment = fast_size * 4;
    };

    /// typetraits float64
    template<>
    struct typetraits <double>
    {
        static const size_t fast_size = SIMDPP_FAST_FLOAT64_SIZE;
        using simd_type = float64<fast_size>;
        static const size_t alignment = fast_size * 8;
    };`

It seems to do the job

I disabled public access to it due to hacking concerns. Could you email me at [email protected] and I'll send you instructions to access it and credentials needed for that.

If you have a time i will check my documentation and make a pull request for std like transform and reduce

By the way do you think over stl like algorithm could be usefull for the library? If i have time i will work on it
Regards
TR

Follow review * fix indent * add "fuzzing" tests for all algorithm * add TEST_EQUAL_COLLECTIONS * add nrt helpers for generating data (to be moved elsewhere ?)

* Try to fix visual 2013/2015 compilation issues * enforce const/inline and noexcept for predicate

p12tic added the enhancement label Jan 14, 2018

p12tic mentioned this issue Jan 14, 2018

incorrect documentation and additional suggestions #108

Open

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Feb 26, 2018

wip issue p12tic#107 add transform/reduce algorithm

8d91b7e

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Feb 26, 2018

issue p12tic#107 add fill,copy,copy_n algorithm

c679f81

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Feb 28, 2018

issue p12tic#107 gcc compil fix

4bc2c63

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 5, 2018

issue p12tic#107 add search max/min

22e5357

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 5, 2018

issue p12tic#107 add find,find_if,find_if_not

5c48da0

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 5, 2018

issue p12tic#107 fix gcc and release mode for find*

ab3e92b

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 6, 2018

issue p12tic#107 add max_element and min_element

b0735f5

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 6, 2018

issue p12tic#107 gcc compil/warning fix

0025a8f

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 7, 2018

issue p12tic#107 add count, count_if

ae48025

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 7, 2018

issue p12tic#107 add all_of, any_of, none_of

dc33d00

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 8, 2018

issue p12tic#107 add replace,replace_if

d6a6bfa

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 10, 2018

issue p12tic#107 add equal and lexicographic_compare

f95aa05

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 11, 2018

issue p12tic#107 add transform_reduce

b8b0b34

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 11, 2018

issue p12tic#107 ras

179cc90

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Mar 11, 2018

issue p12tic#107 visual compilation fix

f57deb0

This was referenced Mar 16, 2018

Dev #114

Open

Use google benchmark as third partie dependencie for bench? #115

Open

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Apr 9, 2018

issue p12tic#107

3d9fb98

Follow review * fix indent * add "fuzzing" tests for all algorithm * add TEST_EQUAL_COLLECTIONS * add nrt helpers for generating data (to be moved elsewhere ?)

ThomasRetornaz pushed a commit to ThomasRetornaz/libsimdpp that referenced this issue Apr 9, 2018

issue p12tic#107 gcc and c++11 only compil fix

d8b2eda

ThomasRetornaz added a commit to ThomasRetornaz/libsimdpp that referenced this issue Apr 11, 2018

issue p12tic#107

9a3636a

* Try to fix visual 2013/2015 compilation issues * enforce const/inline and noexcept for predicate

ThomasRetornaz mentioned this issue Jul 20, 2018

[Load/Store] Very slow #124

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Populate "high level" stl like algorithm #107

Populate "high level" stl like algorithm #107

ThomasRetornaz commented Jan 6, 2018

p12tic commented Jan 8, 2018 •

edited

Loading

p12tic commented Jan 14, 2018

xugng commented Jan 15, 2018

ThomasRetornaz commented Jan 20, 2018

p12tic commented Jan 21, 2018

ThomasRetornaz commented Jan 22, 2018

ThomasRetornaz commented Jan 29, 2018 •

edited

Loading

Populate "high level" stl like algorithm #107

Populate "high level" stl like algorithm #107

Comments

ThomasRetornaz commented Jan 6, 2018

p12tic commented Jan 8, 2018 • edited Loading

p12tic commented Jan 14, 2018

xugng commented Jan 15, 2018

ThomasRetornaz commented Jan 20, 2018

p12tic commented Jan 21, 2018

ThomasRetornaz commented Jan 22, 2018

ThomasRetornaz commented Jan 29, 2018 • edited Loading

p12tic commented Jan 8, 2018 •

edited

Loading

ThomasRetornaz commented Jan 29, 2018 •

edited

Loading