|
PLASMA
2.4.5
PLASMA - Parallel Linear Algebra for Scalable Multi-core Architectures
|

Go to the source code of this file.
Macros | |
| #define | COMPLEX |
Functions | |
| int | CORE_cherfb (PLASMA_enum uplo, int n, int k, int ib, int nb, PLASMA_Complex32_t *A, int lda, PLASMA_Complex32_t *T, int ldt, PLASMA_Complex32_t *C, int ldc, PLASMA_Complex32_t *WORK, int ldwork) |
| void | QUARK_CORE_cherfb (Quark *quark, Quark_Task_Flags *task_flags, PLASMA_enum uplo, int n, int k, int ib, int nb, PLASMA_Complex32_t *A, int lda, PLASMA_Complex32_t *T, int ldt, PLASMA_Complex32_t *C, int ldc) |
| void | CORE_cherfb_quark (Quark *quark) |
PLASMA core_blas kernel PLASMA is a software package provided by Univ. of Tennessee, Univ. of California Berkeley and Univ. of Colorado Denver
Definition in file core_cherfb.c.
| #define COMPLEX |
Definition at line 18 of file core_cherfb.c.
| int CORE_cherfb | ( | PLASMA_enum | uplo, |
| int | n, | ||
| int | k, | ||
| int | ib, | ||
| int | nb, | ||
| PLASMA_Complex32_t * | A, | ||
| int | lda, | ||
| PLASMA_Complex32_t * | T, | ||
| int | ldt, | ||
| PLASMA_Complex32_t * | C, | ||
| int | ldc, | ||
| PLASMA_Complex32_t * | WORK, | ||
| int | ldwork | ||
| ) |
CORE_cherfb overwrites the symmetric complex N-by-N tile C with
Q**T*C*Q
where Q is a complex unitary matrix defined as the product of k elementary reflectors
Q = H(1) H(2) . . . H(k)
as returned by CORE_cgeqrt. Only PlasmaLower supported!
| [in] | uplo |
|
| [in] | n | The number of rows/columns of the tile C. N >= 0. |
| [in] | k | The number of elementary reflectors whose product defines the matrix Q. K >= 0. |
| [in] | ib | The inner-blocking size. IB >= 0. |
| [in] | nb | The blocking size. NB >= 0. |
| [in] | A | The i-th column must contain the vector which defines the elementary reflector H(i), for i = 1,2,...,k, as returned by CORE_cgeqrt in the first k columns of its array argument A. |
| [in] | lda | The leading dimension of the array A. LDA >= max(1,N). |
| [out] | T | The IB-by-K triangular factor T of the block reflector. T is upper triangular by block (economic storage); The rest of the array is not referenced. |
| [in] | ldt | The leading dimension of the array T. LDT >= IB. |
| [in,out] | C | On entry, the symmetric N-by-N tile C. On exit, C is overwritten by Q**T*C*Q. |
| [in] | ldc | The leading dimension of the array C. LDC >= max(1,M). |
| [in,out] | WORK | On exit, if INFO = 0, WORK(1) returns the optimal LDWORK. |
| [in] | ldwork | The dimension of the array WORK. LDWORK >= max(1,N); |
| PLASMA_SUCCESS | successful exit |
| <0 | if -i, the i-th argument had an illegal value |
Definition at line 110 of file core_cherfb.c.
References CORE_cunmlq(), CORE_cunmqr(), PlasmaConjTrans, PlasmaLeft, PlasmaLower, PlasmaNoTrans, and PlasmaRight.


| void CORE_cherfb_quark | ( | Quark * | quark | ) |
Definition at line 215 of file core_cherfb.c.
References A, C, CORE_cherfb(), quark_unpack_args_13, T, and uplo.


| void QUARK_CORE_cherfb | ( | Quark * | quark, |
| Quark_Task_Flags * | task_flags, | ||
| PLASMA_enum | uplo, | ||
| int | n, | ||
| int | k, | ||
| int | ib, | ||
| int | nb, | ||
| PLASMA_Complex32_t * | A, | ||
| int | lda, | ||
| PLASMA_Complex32_t * | T, | ||
| int | ldt, | ||
| PLASMA_Complex32_t * | C, | ||
| int | ldc | ||
| ) |
This kernel is just a workaround for now... will be deleted eventually and replaced by the one above (Piotr's Task)
Definition at line 183 of file core_cherfb.c.
References CORE_cherfb_quark(), INOUT, INPUT, PlasmaUpper, QUARK_Insert_Task(), QUARK_REGION_D, QUARK_REGION_L, QUARK_REGION_U, SCRATCH, and VALUE.

