Project Ne10
An open, optimized software library for the ARM architecture.
Functions | Variables
Vector Dot Product

Functions

ne10_result_t ne10_dot_vec2f_c (ne10_float32_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_dot_vec2f using plain C code. More...
 
ne10_result_t ne10_dot_vec2f_neon (ne10_float32_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) asm("ne10_dot_vec2f_neon")
 Specific implementation of ne10_dot_vec2f using NEON intrinsics. More...
 
ne10_result_t ne10_dot_vec2f_asm (ne10_float32_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_dot_vec2f using NEON assembly. More...
 
ne10_result_t ne10_dot_vec3f_c (ne10_float32_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_dot_vec3f using plain C code. More...
 
ne10_result_t ne10_dot_vec3f_neon (ne10_float32_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) asm("ne10_dot_vec3f_neon")
 Specific implementation of ne10_dot_vec3f using NEON intrinsics. More...
 
ne10_result_t ne10_dot_vec3f_asm (ne10_float32_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_dot_vec3f using NEON assembly. More...
 
ne10_result_t ne10_dot_vec4f_c (ne10_float32_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_dot_vec4f using plain C code. More...
 
ne10_result_t ne10_dot_vec4f_neon (ne10_float32_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) asm("ne10_dot_vec4f_neon")
 Specific implementation of ne10_dot_vec4f using NEON intrinsics. More...
 
ne10_result_t ne10_dot_vec4f_asm (ne10_float32_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_dot_vec4f using NEON assembly. More...
 

Variables

ne10_result_t(* ne10_dot_vec2f )(ne10_float32_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Calculates the dot product of 2D vectors from one input array with those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_dot_vec3f )(ne10_float32_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Calculate the dot product of 3D vectors from one input array with those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_dot_vec4f )(ne10_float32_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Calculates the dot product of 4D vectors from one input array with those of the same index in another, storing the results in an output array. More...
 

Detailed Description

These functions implement vector dot product operations for single precision floating point values.

Function Documentation

ne10_result_t ne10_dot_vec2f_asm ( ne10_float32_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec2f using NEON assembly.

ne10_result_t ne10_dot_vec2f_c ( ne10_float32_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec2f using plain C code.

Definition at line 37 of file NE10_dot.c.

ne10_result_t ne10_dot_vec2f_neon ( ne10_float32_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec2f using NEON intrinsics.

ne10_result_t ne10_dot_vec3f_asm ( ne10_float32_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec3f using NEON assembly.

ne10_result_t ne10_dot_vec3f_c ( ne10_float32_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec3f using plain C code.

Definition at line 48 of file NE10_dot.c.

ne10_result_t ne10_dot_vec3f_neon ( ne10_float32_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec3f using NEON intrinsics.

ne10_result_t ne10_dot_vec4f_asm ( ne10_float32_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec4f using NEON assembly.

ne10_result_t ne10_dot_vec4f_c ( ne10_float32_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec4f using plain C code.

Definition at line 60 of file NE10_dot.c.

ne10_result_t ne10_dot_vec4f_neon ( ne10_float32_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_dot_vec4f using NEON intrinsics.

Variable Documentation

ne10_result_t(* ne10_dot_vec2f) (ne10_float32_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)

Calculates the dot product of 2D vectors from one input array with those of the same index in another, storing the results in an output array.

Points to ne10_dot_vec2f_c, ne10_dot_vec2f_neon, or ne10_dot_vec2f_asm.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of dot products to be performed (i.e. the length of the source arrays)

Definition at line 279 of file NE10_init_math.c.

ne10_result_t(* ne10_dot_vec3f) (ne10_float32_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)

Calculate the dot product of 3D vectors from one input array with those of the same index in another, storing the results in an output array.

Points to ne10_dot_vec3f_c, ne10_dot_vec3f_neon, or ne10_dot_vec3f_asm.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of dot products to be performed (i.e. the length of the source arrays)

Definition at line 280 of file NE10_init_math.c.

ne10_result_t(* ne10_dot_vec4f) (ne10_float32_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)

Calculates the dot product of 4D vectors from one input array with those of the same index in another, storing the results in an output array.

Points to ne10_dot_vec4f_c, ne10_dot_vec4f_neon, or ne10_dot_vec4f_asm.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of dot products to be performed (i.e. the length of the source arrays)

Definition at line 281 of file NE10_init_math.c.