Project Ne10
An open, optimized software library for the ARM architecture.
Functions | Variables
Vector Addition

Functions

ne10_result_t ne10_addc_float_c (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_float using plain C code. More...
 
ne10_result_t ne10_addc_float_neon (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count) asm("ne10_addc_float_neon")
 Specific implementation of ne10_addc_float using NEON intrinsics. More...
 
ne10_result_t ne10_addc_float_asm (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_float using NEON assembly. More...
 
ne10_result_t ne10_addc_vec2f_c (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_vec2f using plain C code. More...
 
ne10_result_t ne10_addc_vec2f_neon (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count) asm("ne10_addc_vec2f_neon")
 Specific implementation of ne10_addc_vec2f using NEON intrinsics. More...
 
ne10_result_t ne10_addc_vec2f_asm (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_vec2f using NEON assembly. More...
 
ne10_result_t ne10_addc_vec3f_c (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_vec3f using plain C code. More...
 
ne10_result_t ne10_addc_vec3f_neon (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count) asm("ne10_addc_vec3f_neon")
 Specific implementation of ne10_addc_vec3f using NEON intrinsics. More...
 
ne10_result_t ne10_addc_vec3f_asm (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_vec3f using NEON assembly. More...
 
ne10_result_t ne10_addc_vec4f_c (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_vec4f using plain C code. More...
 
ne10_result_t ne10_addc_vec4f_neon (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count) asm("ne10_addc_vec4f_neon")
 Specific implementation of ne10_addc_vec4f using NEON intrinsics. More...
 
ne10_result_t ne10_addc_vec4f_asm (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_addc_vec4f using NEON assembly. More...
 
ne10_result_t ne10_add_float_c (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_float using plain C code. More...
 
ne10_result_t ne10_add_float_neon (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count) asm("ne10_add_float_neon")
 Specific implementation of ne10_add_float using NEON intrinsics. More...
 
ne10_result_t ne10_add_float_asm (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_float using NEON assembly. More...
 
ne10_result_t ne10_add_vec2f_c (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_vec2f using plain C code. More...
 
ne10_result_t ne10_add_vec2f_neon (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) asm("ne10_add_vec2f_neon")
 Specific implementation of ne10_add_vec2f using NEON intrinsics. More...
 
ne10_result_t ne10_add_vec2f_asm (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_vec2f using NEON assembly. More...
 
ne10_result_t ne10_add_vec3f_c (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_vec3f using plain C code. More...
 
ne10_result_t ne10_add_vec3f_neon (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) asm("ne10_add_vec3f_neon")
 Specific implementation of ne10_add_vec3f using NEON intrinsics. More...
 
ne10_result_t ne10_add_vec3f_asm (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_vec3f using NEON assembly. More...
 
ne10_result_t ne10_add_vec4f_c (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_vec4f using plain C code. More...
 
ne10_result_t ne10_add_vec4f_neon (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) asm("ne10_add_vec4f_neon")
 Specific implementation of ne10_add_vec4f using NEON intrinsics. More...
 
ne10_result_t ne10_add_vec4f_asm (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_add_vec4f using NEON assembly. More...
 

Variables

ne10_result_t(* ne10_addc_float )(ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Adds a constant scalar value to all elements of an input array, storing the results in an output array. More...
 
ne10_result_t(* ne10_addc_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Adds a constant 2D vector to all the vectors in an input array, storing the results in an output array. More...
 
ne10_result_t(* ne10_addc_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Adds a constant 3D vector to all the vectors in an input array, storing the results in an output array. More...
 
ne10_result_t(* ne10_addc_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Adds a constant 4D vector to all the vectors in an input array, storing the results in an output array. More...
 
ne10_result_t(* ne10_add_float )(ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)
 Adds the scalar elements of one input array with those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_add_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Adds the 2D vectors of one input array with those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_add_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Adds the 3D vectors of one input array with those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_add_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Adds the 4D vectors of one input array with those of the same index in another, storing the results in an output array. More...
 

Detailed Description

These functions implement vector addition operations for single precision floating point values.

Function Documentation

ne10_result_t ne10_add_float_asm ( ne10_float32_t dst,
ne10_float32_t src1,
ne10_float32_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_float using NEON assembly.

ne10_result_t ne10_add_float_c ( ne10_float32_t dst,
ne10_float32_t src1,
ne10_float32_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_float using plain C code.

Definition at line 38 of file NE10_add.c.

ne10_result_t ne10_add_float_neon ( ne10_float32_t dst,
ne10_float32_t src1,
ne10_float32_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_float using NEON intrinsics.

ne10_result_t ne10_add_vec2f_asm ( ne10_vec2f_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec2f using NEON assembly.

ne10_result_t ne10_add_vec2f_c ( ne10_vec2f_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec2f using plain C code.

Definition at line 48 of file NE10_add.c.

ne10_result_t ne10_add_vec2f_neon ( ne10_vec2f_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec2f using NEON intrinsics.

ne10_result_t ne10_add_vec3f_asm ( ne10_vec3f_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec3f using NEON assembly.

ne10_result_t ne10_add_vec3f_c ( ne10_vec3f_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec3f using plain C code.

Definition at line 59 of file NE10_add.c.

ne10_result_t ne10_add_vec3f_neon ( ne10_vec3f_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec3f using NEON intrinsics.

ne10_result_t ne10_add_vec4f_asm ( ne10_vec4f_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec4f using NEON assembly.

ne10_result_t ne10_add_vec4f_c ( ne10_vec4f_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec4f using plain C code.

Definition at line 71 of file NE10_add.c.

ne10_result_t ne10_add_vec4f_neon ( ne10_vec4f_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_add_vec4f using NEON intrinsics.

ne10_result_t ne10_addc_float_asm ( ne10_float32_t dst,
ne10_float32_t src,
const ne10_float32_t  cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_float using NEON assembly.

ne10_result_t ne10_addc_float_c ( ne10_float32_t dst,
ne10_float32_t src,
const ne10_float32_t  cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_float using plain C code.

Examples:
NE10_sample_intro.c.

Definition at line 37 of file NE10_addc.c.

ne10_result_t ne10_addc_float_neon ( ne10_float32_t dst,
ne10_float32_t src,
const ne10_float32_t  cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_float using NEON intrinsics.

Examples:
NE10_sample_intro.c.

Definition at line 39 of file NE10_addc.neon.c.

ne10_result_t ne10_addc_vec2f_asm ( ne10_vec2f_t dst,
ne10_vec2f_t src,
const ne10_vec2f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec2f using NEON assembly.

ne10_result_t ne10_addc_vec2f_c ( ne10_vec2f_t dst,
ne10_vec2f_t src,
const ne10_vec2f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec2f using plain C code.

Definition at line 47 of file NE10_addc.c.

ne10_result_t ne10_addc_vec2f_neon ( ne10_vec2f_t dst,
ne10_vec2f_t src,
const ne10_vec2f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec2f using NEON intrinsics.

Definition at line 49 of file NE10_addc.neon.c.

ne10_result_t ne10_addc_vec3f_asm ( ne10_vec3f_t dst,
ne10_vec3f_t src,
const ne10_vec3f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec3f using NEON assembly.

ne10_result_t ne10_addc_vec3f_c ( ne10_vec3f_t dst,
ne10_vec3f_t src,
const ne10_vec3f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec3f using plain C code.

Definition at line 58 of file NE10_addc.c.

ne10_result_t ne10_addc_vec3f_neon ( ne10_vec3f_t dst,
ne10_vec3f_t src,
const ne10_vec3f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec3f using NEON intrinsics.

Definition at line 59 of file NE10_addc.neon.c.

ne10_result_t ne10_addc_vec4f_asm ( ne10_vec4f_t dst,
ne10_vec4f_t src,
const ne10_vec4f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec4f using NEON assembly.

ne10_result_t ne10_addc_vec4f_c ( ne10_vec4f_t dst,
ne10_vec4f_t src,
const ne10_vec4f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec4f using plain C code.

Definition at line 70 of file NE10_addc.c.

ne10_result_t ne10_addc_vec4f_neon ( ne10_vec4f_t dst,
ne10_vec4f_t src,
const ne10_vec4f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_addc_vec4f using NEON intrinsics.

Definition at line 73 of file NE10_addc.neon.c.

Variable Documentation

ne10_result_t(* ne10_add_float) (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)

Adds the scalar elements of one input array with those of the same index in another, storing the results in an output array.

Points to ne10_add_float_c, ne10_add_float_neon, or ne10_add_float_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of additions to be performed (i.e. the length of the source arrays)

Definition at line 248 of file NE10_init_math.c.

ne10_result_t(* ne10_add_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)

Adds the 2D vectors of one input array with those of the same index in another, storing the results in an output array.

Points to ne10_add_vec2f_c, ne10_add_vec2f_neon, or ne10_add_vec2f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of additions to be performed (i.e. the length of the source arrays)

Definition at line 273 of file NE10_init_math.c.

ne10_result_t(* ne10_add_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)

Adds the 3D vectors of one input array with those of the same index in another, storing the results in an output array.

Points to ne10_add_vec3f_c, ne10_add_vec3f_neon, or ne10_add_vec3f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of additions to be performed (i.e. the length of the source arrays)

Definition at line 274 of file NE10_init_math.c.

ne10_result_t(* ne10_add_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)

Adds the 4D vectors of one input array with those of the same index in another, storing the results in an output array.

Points to ne10_add_vec4f_c, ne10_add_vec4f_neon, or ne10_add_vec4f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of additions to be performed (i.e. the length of the source arrays)

Definition at line 275 of file NE10_init_math.c.

ne10_result_t(* ne10_addc_float) (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)

Adds a constant scalar value to all elements of an input array, storing the results in an output array.

Points to ne10_addc_float_c, ne10_addc_float_neon, or ne10_addc_float_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstThe constant to add
[in]countThe number of scalar values to be processed
Examples:
NE10_sample_intro.c.

Definition at line 220 of file NE10_init_math.c.

ne10_result_t(* ne10_addc_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)

Adds a constant 2D vector to all the vectors in an input array, storing the results in an output array.

Points to ne10_addc_vec2f_c, ne10_addc_vec2f_neon, or ne10_addc_vec2f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstPointer to the constant 2D vector to add
[in]countThe number of 2D vectors to be processed

Definition at line 221 of file NE10_init_math.c.

ne10_result_t(* ne10_addc_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)

Adds a constant 3D vector to all the vectors in an input array, storing the results in an output array.

Points to ne10_addc_vec3f_c, ne10_addc_vec3f_neon, or ne10_addc_vec3f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstPointer to the constant 3D vector to add
[in]countThe number of 3D vectors to be processed

Definition at line 222 of file NE10_init_math.c.

ne10_result_t(* ne10_addc_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)

Adds a constant 4D vector to all the vectors in an input array, storing the results in an output array.

Points to ne10_addc_vec4f_c, ne10_addc_vec4f_neon, or ne10_addc_vec4f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstPointer to the constant 4D vector to add
[in]countThe number of 4D vectors to be processed

Definition at line 223 of file NE10_init_math.c.