Project Ne10
An open, optimized software library for the ARM architecture.
|
Variables | |
ne10_result_t(* | ne10_addc_float )(ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count) |
Adds a constant scalar value to all elements of an input array, storing the results in an output array. More... | |
ne10_result_t(* | ne10_addc_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count) |
Adds a constant 2D vector to all the vectors in an input array, storing the results in an output array. More... | |
ne10_result_t(* | ne10_addc_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count) |
Adds a constant 3D vector to all the vectors in an input array, storing the results in an output array. More... | |
ne10_result_t(* | ne10_addc_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count) |
Adds a constant 4D vector to all the vectors in an input array, storing the results in an output array. More... | |
ne10_result_t(* | ne10_add_float )(ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count) |
Adds the scalar elements of one input array with those of the same index in another, storing the results in an output array. More... | |
ne10_result_t(* | ne10_add_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) |
Adds the 2D vectors of one input array with those of the same index in another, storing the results in an output array. More... | |
ne10_result_t(* | ne10_add_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) |
Adds the 3D vectors of one input array with those of the same index in another, storing the results in an output array. More... | |
ne10_result_t(* | ne10_add_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) |
Adds the 4D vectors of one input array with those of the same index in another, storing the results in an output array. More... | |
ne10_result_t ne10_add_float_asm | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src1, | ||
ne10_float32_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_float using NEON assembly.
ne10_result_t ne10_add_float_c | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src1, | ||
ne10_float32_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_float using plain C code.
Definition at line 38 of file NE10_add.c.
ne10_result_t ne10_add_float_neon | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src1, | ||
ne10_float32_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_float using NEON intrinsics.
ne10_result_t ne10_add_vec2f_asm | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src1, | ||
ne10_vec2f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec2f using NEON assembly.
ne10_result_t ne10_add_vec2f_c | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src1, | ||
ne10_vec2f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec2f using plain C code.
Definition at line 48 of file NE10_add.c.
ne10_result_t ne10_add_vec2f_neon | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src1, | ||
ne10_vec2f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec2f using NEON intrinsics.
ne10_result_t ne10_add_vec3f_asm | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src1, | ||
ne10_vec3f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec3f using NEON assembly.
ne10_result_t ne10_add_vec3f_c | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src1, | ||
ne10_vec3f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec3f using plain C code.
Definition at line 59 of file NE10_add.c.
ne10_result_t ne10_add_vec3f_neon | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src1, | ||
ne10_vec3f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec3f using NEON intrinsics.
ne10_result_t ne10_add_vec4f_asm | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src1, | ||
ne10_vec4f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec4f using NEON assembly.
ne10_result_t ne10_add_vec4f_c | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src1, | ||
ne10_vec4f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec4f using plain C code.
Definition at line 71 of file NE10_add.c.
ne10_result_t ne10_add_vec4f_neon | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src1, | ||
ne10_vec4f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_add_vec4f using NEON intrinsics.
ne10_result_t ne10_addc_float_asm | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src, | ||
const ne10_float32_t | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_float using NEON assembly.
ne10_result_t ne10_addc_float_c | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src, | ||
const ne10_float32_t | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_float using plain C code.
Definition at line 37 of file NE10_addc.c.
ne10_result_t ne10_addc_float_neon | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src, | ||
const ne10_float32_t | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_float using NEON intrinsics.
Definition at line 39 of file NE10_addc.neon.c.
ne10_result_t ne10_addc_vec2f_asm | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src, | ||
const ne10_vec2f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec2f using NEON assembly.
ne10_result_t ne10_addc_vec2f_c | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src, | ||
const ne10_vec2f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec2f using plain C code.
Definition at line 47 of file NE10_addc.c.
ne10_result_t ne10_addc_vec2f_neon | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src, | ||
const ne10_vec2f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec2f using NEON intrinsics.
Definition at line 49 of file NE10_addc.neon.c.
ne10_result_t ne10_addc_vec3f_asm | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src, | ||
const ne10_vec3f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec3f using NEON assembly.
ne10_result_t ne10_addc_vec3f_c | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src, | ||
const ne10_vec3f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec3f using plain C code.
Definition at line 58 of file NE10_addc.c.
ne10_result_t ne10_addc_vec3f_neon | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src, | ||
const ne10_vec3f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec3f using NEON intrinsics.
Definition at line 59 of file NE10_addc.neon.c.
ne10_result_t ne10_addc_vec4f_asm | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src, | ||
const ne10_vec4f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec4f using NEON assembly.
ne10_result_t ne10_addc_vec4f_c | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src, | ||
const ne10_vec4f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec4f using plain C code.
Definition at line 70 of file NE10_addc.c.
ne10_result_t ne10_addc_vec4f_neon | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src, | ||
const ne10_vec4f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_addc_vec4f using NEON intrinsics.
Definition at line 73 of file NE10_addc.neon.c.
ne10_result_t(* ne10_add_float) (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count) |
Adds the scalar elements of one input array with those of the same index in another, storing the results in an output array.
Points to ne10_add_float_c, ne10_add_float_neon, or ne10_add_float_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of additions to be performed (i.e. the length of the source arrays) |
Definition at line 248 of file NE10_init_math.c.
ne10_result_t(* ne10_add_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) |
Adds the 2D vectors of one input array with those of the same index in another, storing the results in an output array.
Points to ne10_add_vec2f_c, ne10_add_vec2f_neon, or ne10_add_vec2f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of additions to be performed (i.e. the length of the source arrays) |
Definition at line 273 of file NE10_init_math.c.
ne10_result_t(* ne10_add_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) |
Adds the 3D vectors of one input array with those of the same index in another, storing the results in an output array.
Points to ne10_add_vec3f_c, ne10_add_vec3f_neon, or ne10_add_vec3f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of additions to be performed (i.e. the length of the source arrays) |
Definition at line 274 of file NE10_init_math.c.
ne10_result_t(* ne10_add_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) |
Adds the 4D vectors of one input array with those of the same index in another, storing the results in an output array.
Points to ne10_add_vec4f_c, ne10_add_vec4f_neon, or ne10_add_vec4f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of additions to be performed (i.e. the length of the source arrays) |
Definition at line 275 of file NE10_init_math.c.
ne10_result_t(* ne10_addc_float) (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count) |
Adds a constant scalar value to all elements of an input array, storing the results in an output array.
Points to ne10_addc_float_c, ne10_addc_float_neon, or ne10_addc_float_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | The constant to add |
[in] | count | The number of scalar values to be processed |
Definition at line 220 of file NE10_init_math.c.
ne10_result_t(* ne10_addc_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count) |
Adds a constant 2D vector to all the vectors in an input array, storing the results in an output array.
Points to ne10_addc_vec2f_c, ne10_addc_vec2f_neon, or ne10_addc_vec2f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | Pointer to the constant 2D vector to add |
[in] | count | The number of 2D vectors to be processed |
Definition at line 221 of file NE10_init_math.c.
ne10_result_t(* ne10_addc_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count) |
Adds a constant 3D vector to all the vectors in an input array, storing the results in an output array.
Points to ne10_addc_vec3f_c, ne10_addc_vec3f_neon, or ne10_addc_vec3f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | Pointer to the constant 3D vector to add |
[in] | count | The number of 3D vectors to be processed |
Definition at line 222 of file NE10_init_math.c.
ne10_result_t(* ne10_addc_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count) |
Adds a constant 4D vector to all the vectors in an input array, storing the results in an output array.
Points to ne10_addc_vec4f_c, ne10_addc_vec4f_neon, or ne10_addc_vec4f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | Pointer to the constant 4D vector to add |
[in] | count | The number of 4D vectors to be processed |
Definition at line 223 of file NE10_init_math.c.