Project Ne10
An open, optimized software library for the ARM architecture.
Functions | Variables
Vector Multiplication

Functions

ne10_result_t ne10_mulc_float_c (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_float using plain C code. More...
 
ne10_result_t ne10_mulc_float_neon (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_float using NEON intrinsics. More...
 
ne10_result_t ne10_mulc_float_asm (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_float using NEON assembly. More...
 
ne10_result_t ne10_mulc_vec2f_c (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec2f using plain C code. More...
 
ne10_result_t ne10_mulc_vec2f_neon (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec2f using NEON intrinsics. More...
 
ne10_result_t ne10_mulc_vec2f_asm (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec2f using NEON assembly. More...
 
ne10_result_t ne10_mulc_vec3f_c (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec3f using plain C code. More...
 
ne10_result_t ne10_mulc_vec3f_neon (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec3f using NEON intrinsics. More...
 
ne10_result_t ne10_mulc_vec3f_asm (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec3f using NEON assembly. More...
 
ne10_result_t ne10_mulc_vec4f_c (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec4f using plain C code. More...
 
ne10_result_t ne10_mulc_vec4f_neon (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec4f using NEON intrinsics. More...
 
ne10_result_t ne10_mulc_vec4f_asm (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Specific implementation of ne10_mulc_vec4f using NEON assembly. More...
 
ne10_result_t ne10_mul_float_c (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_mul_float using plain C code. More...
 
ne10_result_t ne10_mul_float_neon (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count) asm("ne10_mul_float_neon")
 Specific implementation of ne10_mul_float using NEON intrinsics. More...
 
ne10_result_t ne10_mul_float_asm (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_mul_float using NEON assembly. More...
 
ne10_result_t ne10_vmul_vec2f_c (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_vmul_vec2f using plain C code. More...
 
ne10_result_t ne10_vmul_vec2f_neon (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count) asm("ne10_vmul_vec2f_neon")
 Specific implementation of ne10_vmul_vec2f using NEON intrinsics. More...
 
ne10_result_t ne10_vmul_vec2f_asm (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_vmul_vec2f using NEON assembly. More...
 
ne10_result_t ne10_vmul_vec3f_c (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_vmul_vec3f using plain C code. More...
 
ne10_result_t ne10_vmul_vec3f_neon (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count) asm("ne10_vmul_vec3f_neon")
 Specific implementation of ne10_vmul_vec3f using NEON intrinsics. More...
 
ne10_result_t ne10_vmul_vec3f_asm (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_vmul_vec3f using NEON assembly. More...
 
ne10_result_t ne10_vmul_vec4f_c (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_vmul_vec4f using plain C code. More...
 
ne10_result_t ne10_vmul_vec4f_neon (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count) asm("ne10_vmul_vec4f_neon")
 Specific implementation of ne10_vmul_vec4f using NEON intrinsics. More...
 
ne10_result_t ne10_vmul_vec4f_asm (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Specific implementation of ne10_vmul_vec4f using NEON assembly. More...
 

Variables

ne10_result_t(* ne10_mulc_float )(ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)
 Multiplies all scalar elements of an input array by a constant value, storing the results in an output array. More...
 
ne10_result_t(* ne10_mulc_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)
 Element-wise multiplies all the 2D vectors in an input array by a constant vector, storing the results in an output array. More...
 
ne10_result_t(* ne10_mulc_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)
 Element-wise multiplies all the 3D vectors in an input array by a constant vector, storing the results in an output array. More...
 
ne10_result_t(* ne10_mulc_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)
 Element-wise multiplies all the 4D vectors in an input array by a constant vector, storing the results in an output array. More...
 
ne10_result_t(* ne10_mul_float )(ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)
 Multiplies the scalar elements of one input array by those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_vmul_vec2f )(ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)
 Element-wise multiplies the 2D vectors of one input array by those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_vmul_vec3f )(ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)
 Element-wise multiplies the 3D vectors of one input array by those of the same index in another, storing the results in an output array. More...
 
ne10_result_t(* ne10_vmul_vec4f )(ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)
 Element-wise multiplies the 4D vectors of one input array by those of the same index in another, storing the results in an output array. More...
 

Detailed Description

These functions implement vector element-wise multiplication operations for single precision floating point values.

Function Documentation

ne10_result_t ne10_mul_float_asm ( ne10_float32_t dst,
ne10_float32_t src1,
ne10_float32_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_mul_float using NEON assembly.

ne10_result_t ne10_mul_float_c ( ne10_float32_t dst,
ne10_float32_t src1,
ne10_float32_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_mul_float using plain C code.

Definition at line 37 of file NE10_mul.c.

ne10_result_t ne10_mul_float_neon ( ne10_float32_t dst,
ne10_float32_t src1,
ne10_float32_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_mul_float using NEON intrinsics.

ne10_result_t ne10_mulc_float_asm ( ne10_float32_t dst,
ne10_float32_t src,
const ne10_float32_t  cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_float using NEON assembly.

ne10_result_t ne10_mulc_float_c ( ne10_float32_t dst,
ne10_float32_t src,
const ne10_float32_t  cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_float using plain C code.

Definition at line 37 of file NE10_mulc.c.

ne10_result_t ne10_mulc_float_neon ( ne10_float32_t dst,
ne10_float32_t src,
const ne10_float32_t  cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_float using NEON intrinsics.

Definition at line 39 of file NE10_mulc.neon.c.

ne10_result_t ne10_mulc_vec2f_asm ( ne10_vec2f_t dst,
ne10_vec2f_t src,
const ne10_vec2f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec2f using NEON assembly.

ne10_result_t ne10_mulc_vec2f_c ( ne10_vec2f_t dst,
ne10_vec2f_t src,
const ne10_vec2f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec2f using plain C code.

Definition at line 47 of file NE10_mulc.c.

ne10_result_t ne10_mulc_vec2f_neon ( ne10_vec2f_t dst,
ne10_vec2f_t src,
const ne10_vec2f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec2f using NEON intrinsics.

Definition at line 49 of file NE10_mulc.neon.c.

ne10_result_t ne10_mulc_vec3f_asm ( ne10_vec3f_t dst,
ne10_vec3f_t src,
const ne10_vec3f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec3f using NEON assembly.

ne10_result_t ne10_mulc_vec3f_c ( ne10_vec3f_t dst,
ne10_vec3f_t src,
const ne10_vec3f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec3f using plain C code.

Definition at line 58 of file NE10_mulc.c.

ne10_result_t ne10_mulc_vec3f_neon ( ne10_vec3f_t dst,
ne10_vec3f_t src,
const ne10_vec3f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec3f using NEON intrinsics.

Definition at line 59 of file NE10_mulc.neon.c.

ne10_result_t ne10_mulc_vec4f_asm ( ne10_vec4f_t dst,
ne10_vec4f_t src,
const ne10_vec4f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec4f using NEON assembly.

ne10_result_t ne10_mulc_vec4f_c ( ne10_vec4f_t dst,
ne10_vec4f_t src,
const ne10_vec4f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec4f using plain C code.

Definition at line 70 of file NE10_mulc.c.

ne10_result_t ne10_mulc_vec4f_neon ( ne10_vec4f_t dst,
ne10_vec4f_t src,
const ne10_vec4f_t cst,
ne10_uint32_t  count 
)

Specific implementation of ne10_mulc_vec4f using NEON intrinsics.

Definition at line 73 of file NE10_mulc.neon.c.

ne10_result_t ne10_vmul_vec2f_asm ( ne10_vec2f_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec2f using NEON assembly.

ne10_result_t ne10_vmul_vec2f_c ( ne10_vec2f_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec2f using plain C code.

Definition at line 47 of file NE10_mul.c.

ne10_result_t ne10_vmul_vec2f_neon ( ne10_vec2f_t dst,
ne10_vec2f_t src1,
ne10_vec2f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec2f using NEON intrinsics.

ne10_result_t ne10_vmul_vec3f_asm ( ne10_vec3f_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec3f using NEON assembly.

ne10_result_t ne10_vmul_vec3f_c ( ne10_vec3f_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec3f using plain C code.

Definition at line 58 of file NE10_mul.c.

ne10_result_t ne10_vmul_vec3f_neon ( ne10_vec3f_t dst,
ne10_vec3f_t src1,
ne10_vec3f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec3f using NEON intrinsics.

ne10_result_t ne10_vmul_vec4f_asm ( ne10_vec4f_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec4f using NEON assembly.

ne10_result_t ne10_vmul_vec4f_c ( ne10_vec4f_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec4f using plain C code.

Definition at line 70 of file NE10_mul.c.

ne10_result_t ne10_vmul_vec4f_neon ( ne10_vec4f_t dst,
ne10_vec4f_t src1,
ne10_vec4f_t src2,
ne10_uint32_t  count 
)

Specific implementation of ne10_vmul_vec4f using NEON intrinsics.

Variable Documentation

ne10_result_t(* ne10_mul_float) (ne10_float32_t *dst, ne10_float32_t *src1, ne10_float32_t *src2, ne10_uint32_t count)

Multiplies the scalar elements of one input array by those of the same index in another, storing the results in an output array.

Points to ne10_mul_float_c, ne10_mul_float_neon, or ne10_mul_float_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of multiplications to be performed (i.e. the length of the source arrays)

Definition at line 250 of file NE10_init_math.c.

ne10_result_t(* ne10_mulc_float) (ne10_float32_t *dst, ne10_float32_t *src, const ne10_float32_t cst, ne10_uint32_t count)

Multiplies all scalar elements of an input array by a constant value, storing the results in an output array.

Points to ne10_mulc_float_c, ne10_mulc_float_neon, or ne10_mulc_float_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstThe constant to multiply by
[in]countThe number of scalar values to be processed

Definition at line 232 of file NE10_init_math.c.

ne10_result_t(* ne10_mulc_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src, const ne10_vec2f_t *cst, ne10_uint32_t count)

Element-wise multiplies all the 2D vectors in an input array by a constant vector, storing the results in an output array.

Points to ne10_mulc_vec2f_c, ne10_mulc_vec2f_neon, or ne10_mulc_vec2f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstPointer to the constant 2D vector to multiply by
[in]countThe number of 2D vectors to be processed

Definition at line 233 of file NE10_init_math.c.

ne10_result_t(* ne10_mulc_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src, const ne10_vec3f_t *cst, ne10_uint32_t count)

Element-wise multiplies all the 3D vectors in an input array by a constant vector, storing the results in an output array.

Points to ne10_mulc_vec3f_c, ne10_mulc_vec3f_neon, or ne10_mulc_vec3f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstPointer to the constant 3D vector to multiply by
[in]countThe number of 3D vectors to be processed

Definition at line 234 of file NE10_init_math.c.

ne10_result_t(* ne10_mulc_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src, const ne10_vec4f_t *cst, ne10_uint32_t count)

Element-wise multiplies all the 4D vectors in an input array by a constant vector, storing the results in an output array.

Points to ne10_mulc_vec4f_c, ne10_mulc_vec4f_neon, or ne10_mulc_vec4f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]srcPointer to the source array
[in]cstPointer to the constant 4D vector to multiply by
[in]countThe number of 4D vectors to be processed

Definition at line 235 of file NE10_init_math.c.

ne10_result_t(* ne10_vmul_vec2f) (ne10_vec2f_t *dst, ne10_vec2f_t *src1, ne10_vec2f_t *src2, ne10_uint32_t count)

Element-wise multiplies the 2D vectors of one input array by those of the same index in another, storing the results in an output array.

Points to ne10_vmul_vec2f_c, ne10_vmul_vec2f_neon, or ne10_vmul_vec2f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of multiplications to be performed (i.e. the length of the source arrays)

Definition at line 264 of file NE10_init_math.c.

ne10_result_t(* ne10_vmul_vec3f) (ne10_vec3f_t *dst, ne10_vec3f_t *src1, ne10_vec3f_t *src2, ne10_uint32_t count)

Element-wise multiplies the 3D vectors of one input array by those of the same index in another, storing the results in an output array.

Points to ne10_vmul_vec3f_c, ne10_vmul_vec3f_neon, or ne10_vmul_vec3f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of multiplications to be performed (i.e. the length of the source arrays)

Definition at line 265 of file NE10_init_math.c.

ne10_result_t(* ne10_vmul_vec4f) (ne10_vec4f_t *dst, ne10_vec4f_t *src1, ne10_vec4f_t *src2, ne10_uint32_t count)

Element-wise multiplies the 4D vectors of one input array by those of the same index in another, storing the results in an output array.

Points to ne10_vmul_vec4f_c, ne10_vmul_vec4f_neon, or ne10_vmul_vec4f_asm. This operation can be performed in-place.

Parameters
[out]dstPointer to the destination array
[in]src1Pointer to the first source array
[in]src2Pointer to the second source array
[in]countThe number of multiplications to be performed (i.e. the length of the source arrays)

Definition at line 266 of file NE10_init_math.c.