PLASMA
2.4.5
PLASMA - Parallel Linear Algebra for Scalable Multi-core Architectures
|
Go to the source code of this file.
Macros | |
#define | REAL |
Functions | |
int | CORE_dtsmqr_corner (int m1, int n1, int m2, int n2, int m3, int n3, int k, int ib, int nb, double *A1, int lda1, double *A2, int lda2, double *A3, int lda3, double *V, int ldv, double *T, int ldt, double *WORK, int ldwork) |
void | QUARK_CORE_dtsmqr_corner (Quark *quark, Quark_Task_Flags *task_flags, int m1, int n1, int m2, int n2, int m3, int n3, int k, int ib, int nb, double *A1, int lda1, double *A2, int lda2, double *A3, int lda3, double *V, int ldv, double *T, int ldt) |
void | CORE_dtsmqr_corner_quark (Quark *quark) |
PLASMA core_blas kernel PLASMA is a software package provided by Univ. of Tennessee, Univ. of California Berkeley and Univ. of Colorado Denver
Definition in file core_dtsmqr_corner.c.
#define REAL |
Definition at line 20 of file core_dtsmqr_corner.c.
int CORE_dtsmqr_corner | ( | int | m1, |
int | n1, | ||
int | m2, | ||
int | n2, | ||
int | m3, | ||
int | n3, | ||
int | k, | ||
int | ib, | ||
int | nb, | ||
double * | A1, | ||
int | lda1, | ||
double * | A2, | ||
int | lda2, | ||
double * | A3, | ||
int | lda3, | ||
double * | V, | ||
int | ldv, | ||
double * | T, | ||
int | ldt, | ||
double * | WORK, | ||
int | ldwork | ||
) |
CORE_dtsmqr_corner: see CORE_dtsmqr
This kernel applies left and right transformations as depicted below: |I -VT'V'| * | A1 A2'| * |I - VTV'| | A2 A3 | where A1 and A3 are symmetric matrices. Only the lower part is referenced. This is an adhoc implementation, can be further optimized...
[in] | side |
|
[in] | trans |
|
[in] | M1 | The number of rows of the tile A1. M1 >= 0. |
[in] | N1 | The number of columns of the tile A1. N1 >= 0. |
[in] | M2 | The number of rows of the tile A2. M2 >= 0. M2 = M1 if side == PlasmaRight. |
[in] | N2 | The number of columns of the tile A2. N2 >= 0. N2 = N1 if side == PlasmaLeft. |
[in] | K | The number of elementary reflectors whose product defines the matrix Q. |
[in] | IB | The inner-blocking size. IB >= 0. |
[in,out] | A1 | On entry, the M1-by-N1 tile A1. On exit, A1 is overwritten by the application of Q. |
[in] | LDA1 | The leading dimension of the array A1. LDA1 >= max(1,M1). |
[in,out] | A2 | On entry, the M2-by-N2 tile A2. On exit, A2 is overwritten by the application of Q. |
[in] | LDA2 | The leading dimension of the tile A2. LDA2 >= max(1,M2). |
[in] | V | The i-th row must contain the vector which defines the elementary reflector H(i), for i = 1,2,...,k, as returned by CORE_DTSQRT in the first k columns of its array argument V. |
[in] | LDV | The leading dimension of the array V. LDV >= max(1,K). |
[out] | T | The IB-by-N1 triangular factor T of the block reflector. T is upper triangular by block (economic storage); The rest of the array is not referenced. |
[in] | LDT | The leading dimension of the array T. LDT >= IB. |
[out] | WORK | Workspace array of size LDWORK-by-N1 if side == PlasmaLeft LDWORK-by-IB if side == PlasmaRight |
[in] | LDWORK | The leading dimension of the array WORK. LDWORK >= max(1,IB) if side == PlasmaLeft LDWORK >= max(1,M1) if side == PlasmaRight |
PLASMA_SUCCESS | successful exit |
<0 | if -i, the i-th argument had an illegal value |
Definition at line 125 of file core_dtsmqr_corner.c.
References CORE_dtsmqr(), coreblas_error, PLASMA_SUCCESS, PlasmaLeft, PlasmaNoTrans, PlasmaRight, PlasmaTrans, side, and trans.
void CORE_dtsmqr_corner_quark | ( | Quark * | quark | ) |
Definition at line 254 of file core_dtsmqr_corner.c.
References CORE_dtsmqr_corner(), quark_unpack_args_21, T, and V.
void QUARK_CORE_dtsmqr_corner | ( | Quark * | quark, |
Quark_Task_Flags * | task_flags, | ||
int | m1, | ||
int | n1, | ||
int | m2, | ||
int | n2, | ||
int | m3, | ||
int | n3, | ||
int | k, | ||
int | ib, | ||
int | nb, | ||
double * | A1, | ||
int | lda1, | ||
double * | A2, | ||
int | lda2, | ||
double * | A3, | ||
int | lda3, | ||
double * | V, | ||
int | ldv, | ||
double * | T, | ||
int | ldt | ||
) |
Definition at line 214 of file core_dtsmqr_corner.c.
References CORE_dtsmqr_corner_quark(), INOUT, INPUT, QUARK_Insert_Task(), QUARK_REGION_D, QUARK_REGION_L, SCRATCH, and VALUE.