Project Ne10
An open, optimized software library for the ARM architecture.
|
Variables | |
ne10_result_t(* | ne10_mulc_float )(ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count) |
Multiplies all scalar elements of an input array by a constant value, storing the results in an output array. More... | |
ne10_result_t(* | ne10_mulc_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count) |
Element-wise multiplies all the 2D vectors in an input array by a constant vector, storing the results in an output array. More... | |
ne10_result_t(* | ne10_mulc_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count) |
Element-wise multiplies all the 3D vectors in an input array by a constant vector, storing the results in an output array. More... | |
ne10_result_t(* | ne10_mulc_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count) |
Element-wise multiplies all the 4D vectors in an input array by a constant vector, storing the results in an output array. More... | |
ne10_result_t(* | ne10_mul_float )(ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count) |
Multiplies the scalar elements of one input array by those of the same index in another, storing the results in an output array. More... | |
ne10_result_t(* | ne10_vmul_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) |
Element-wise multiplies the 2D vectors of one input array by those of the same index in another, storing the results in an output array. More... | |
ne10_result_t(* | ne10_vmul_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) |
Element-wise multiplies the 3D vectors of one input array by those of the same index in another, storing the results in an output array. More... | |
ne10_result_t(* | ne10_vmul_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) |
Element-wise multiplies the 4D vectors of one input array by those of the same index in another, storing the results in an output array. More... | |
ne10_result_t ne10_mul_float_asm | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src1, | ||
ne10_float32_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mul_float using NEON assembly.
ne10_result_t ne10_mul_float_c | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src1, | ||
ne10_float32_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mul_float using plain C code.
Definition at line 37 of file NE10_mul.c.
ne10_result_t ne10_mul_float_neon | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src1, | ||
ne10_float32_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mul_float using NEON intrinsics.
ne10_result_t ne10_mulc_float_asm | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src, | ||
const ne10_float32_t | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_float using NEON assembly.
ne10_result_t ne10_mulc_float_c | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src, | ||
const ne10_float32_t | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_float using plain C code.
Definition at line 37 of file NE10_mulc.c.
ne10_result_t ne10_mulc_float_neon | ( | ne10_float32_t * | dst, |
ne10_float32_t * | src, | ||
const ne10_float32_t | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_float using NEON intrinsics.
Definition at line 39 of file NE10_mulc.neon.c.
ne10_result_t ne10_mulc_vec2f_asm | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src, | ||
const ne10_vec2f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec2f using NEON assembly.
ne10_result_t ne10_mulc_vec2f_c | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src, | ||
const ne10_vec2f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec2f using plain C code.
Definition at line 47 of file NE10_mulc.c.
ne10_result_t ne10_mulc_vec2f_neon | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src, | ||
const ne10_vec2f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec2f using NEON intrinsics.
Definition at line 49 of file NE10_mulc.neon.c.
ne10_result_t ne10_mulc_vec3f_asm | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src, | ||
const ne10_vec3f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec3f using NEON assembly.
ne10_result_t ne10_mulc_vec3f_c | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src, | ||
const ne10_vec3f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec3f using plain C code.
Definition at line 58 of file NE10_mulc.c.
ne10_result_t ne10_mulc_vec3f_neon | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src, | ||
const ne10_vec3f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec3f using NEON intrinsics.
Definition at line 59 of file NE10_mulc.neon.c.
ne10_result_t ne10_mulc_vec4f_asm | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src, | ||
const ne10_vec4f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec4f using NEON assembly.
ne10_result_t ne10_mulc_vec4f_c | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src, | ||
const ne10_vec4f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec4f using plain C code.
Definition at line 70 of file NE10_mulc.c.
ne10_result_t ne10_mulc_vec4f_neon | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src, | ||
const ne10_vec4f_t * | cst, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_mulc_vec4f using NEON intrinsics.
Definition at line 73 of file NE10_mulc.neon.c.
ne10_result_t ne10_vmul_vec2f_asm | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src1, | ||
ne10_vec2f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec2f using NEON assembly.
ne10_result_t ne10_vmul_vec2f_c | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src1, | ||
ne10_vec2f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec2f using plain C code.
Definition at line 47 of file NE10_mul.c.
ne10_result_t ne10_vmul_vec2f_neon | ( | ne10_vec2f_t * | dst, |
ne10_vec2f_t * | src1, | ||
ne10_vec2f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec2f using NEON intrinsics.
ne10_result_t ne10_vmul_vec3f_asm | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src1, | ||
ne10_vec3f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec3f using NEON assembly.
ne10_result_t ne10_vmul_vec3f_c | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src1, | ||
ne10_vec3f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec3f using plain C code.
Definition at line 58 of file NE10_mul.c.
ne10_result_t ne10_vmul_vec3f_neon | ( | ne10_vec3f_t * | dst, |
ne10_vec3f_t * | src1, | ||
ne10_vec3f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec3f using NEON intrinsics.
ne10_result_t ne10_vmul_vec4f_asm | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src1, | ||
ne10_vec4f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec4f using NEON assembly.
ne10_result_t ne10_vmul_vec4f_c | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src1, | ||
ne10_vec4f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec4f using plain C code.
Definition at line 70 of file NE10_mul.c.
ne10_result_t ne10_vmul_vec4f_neon | ( | ne10_vec4f_t * | dst, |
ne10_vec4f_t * | src1, | ||
ne10_vec4f_t * | src2, | ||
ne10_uint32_t | count | ||
) |
Specific implementation of ne10_vmul_vec4f using NEON intrinsics.
ne10_result_t(* ne10_mul_float) (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count) |
Multiplies the scalar elements of one input array by those of the same index in another, storing the results in an output array.
Points to ne10_mul_float_c, ne10_mul_float_neon, or ne10_mul_float_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of multiplications to be performed (i.e. the length of the source arrays) |
Definition at line 250 of file NE10_init_math.c.
ne10_result_t(* ne10_mulc_float) (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count) |
Multiplies all scalar elements of an input array by a constant value, storing the results in an output array.
Points to ne10_mulc_float_c, ne10_mulc_float_neon, or ne10_mulc_float_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | The constant to multiply by |
[in] | count | The number of scalar values to be processed |
Definition at line 232 of file NE10_init_math.c.
ne10_result_t(* ne10_mulc_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count) |
Element-wise multiplies all the 2D vectors in an input array by a constant vector, storing the results in an output array.
Points to ne10_mulc_vec2f_c, ne10_mulc_vec2f_neon, or ne10_mulc_vec2f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | Pointer to the constant 2D vector to multiply by |
[in] | count | The number of 2D vectors to be processed |
Definition at line 233 of file NE10_init_math.c.
ne10_result_t(* ne10_mulc_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count) |
Element-wise multiplies all the 3D vectors in an input array by a constant vector, storing the results in an output array.
Points to ne10_mulc_vec3f_c, ne10_mulc_vec3f_neon, or ne10_mulc_vec3f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | Pointer to the constant 3D vector to multiply by |
[in] | count | The number of 3D vectors to be processed |
Definition at line 234 of file NE10_init_math.c.
ne10_result_t(* ne10_mulc_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count) |
Element-wise multiplies all the 4D vectors in an input array by a constant vector, storing the results in an output array.
Points to ne10_mulc_vec4f_c, ne10_mulc_vec4f_neon, or ne10_mulc_vec4f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src | Pointer to the source array |
[in] | cst | Pointer to the constant 4D vector to multiply by |
[in] | count | The number of 4D vectors to be processed |
Definition at line 235 of file NE10_init_math.c.
ne10_result_t(* ne10_vmul_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) |
Element-wise multiplies the 2D vectors of one input array by those of the same index in another, storing the results in an output array.
Points to ne10_vmul_vec2f_c, ne10_vmul_vec2f_neon, or ne10_vmul_vec2f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of multiplications to be performed (i.e. the length of the source arrays) |
Definition at line 264 of file NE10_init_math.c.
ne10_result_t(* ne10_vmul_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) |
Element-wise multiplies the 3D vectors of one input array by those of the same index in another, storing the results in an output array.
Points to ne10_vmul_vec3f_c, ne10_vmul_vec3f_neon, or ne10_vmul_vec3f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of multiplications to be performed (i.e. the length of the source arrays) |
Definition at line 265 of file NE10_init_math.c.
ne10_result_t(* ne10_vmul_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) |
Element-wise multiplies the 4D vectors of one input array by those of the same index in another, storing the results in an output array.
Points to ne10_vmul_vec4f_c, ne10_vmul_vec4f_neon, or ne10_vmul_vec4f_asm. This operation can be performed in-place.
[out] | dst | Pointer to the destination array |
[in] | src1 | Pointer to the first source array |
[in] | src2 | Pointer to the second source array |
[in] | count | The number of multiplications to be performed (i.e. the length of the source arrays) |
Definition at line 266 of file NE10_init_math.c.