|
1 | 1 | OpenBLAS ChangeLog |
| 2 | +==================================================================== |
| 3 | +Version 0.2.15 |
| 4 | +27-Oct-2015 |
| 5 | +common: |
| 6 | + * Support cmake on x86/x86-64. Natively compiling on MS Visual Studio. |
| 7 | + (experimental. Thank Hank Anderson for the initial cmake porting work.) |
| 8 | + |
| 9 | + On Linux and Mac OSX, OpenBLAS cmake supports assembly kernels. |
| 10 | + e.g. cmake . |
| 11 | + make |
| 12 | + make test (Optional) |
| 13 | + |
| 14 | + On Windows MS Visual Studio, OpenBLAS cmake only support C kernels. |
| 15 | + (OpenBLAS uses AT&T style assembly, which is not supported by MSVC.) |
| 16 | + e.g. cmake -G "Visual Studio 12 Win64" . |
| 17 | + Open OpenBLAS.sln and build. |
| 18 | + |
| 19 | + * Enable MAX_STACK_ALLOC flags by default. |
| 20 | + Improve ger and gemv for small matrices. |
| 21 | + * Improve gemv parallel with small m and large n case. |
| 22 | + * Improve ?imatcopy when lda==ldb (#633. Thanks, Martin Koehler) |
| 23 | + * Add vecLib benchmarks (#565. Thanks, Andreas Noack.) |
| 24 | + * Fix LAPACK lantr for row major matrices (#634. Thanks, Dan Kortschak) |
| 25 | + * Fix LAPACKE lansy (#640. Thanks, Dan Kortschak) |
| 26 | + * Import bug fixes for LAPACKE s/dormlq, c/zunmlq |
| 27 | + * Raise the signal when pthread_create fails (#668. Thanks, James K. Lowden) |
| 28 | + * Remove g77 from compiler list. |
| 29 | + * Enable AppVeyor Windows CI. |
| 30 | + |
| 31 | +x86/x86-64: |
| 32 | + * Support pure C generic kernels for x86/x86-64. |
| 33 | + * Support Intel Boardwell and Skylake by Haswell kernels. |
| 34 | + * Support AMD Excavator by Steamroller kernels. |
| 35 | + * Optimize s/d/c/zdot for Intel SandyBridge and Haswell. |
| 36 | + * Optimize s/d/c/zdot for AMD Piledriver and Steamroller. |
| 37 | + * Optimize s/d/c/zapxy for Intel SandyBridge and Haswell. |
| 38 | + * Optimize s/d/c/zapxy for AMD Piledriver and Steamroller. |
| 39 | + * Optimize d/c/zscal for Intel Haswell, dscal for Intel SandyBridge. |
| 40 | + * Optimize d/c/zscal for AMD Bulldozer, Piledriver and Steamroller. |
| 41 | + * Optimize s/dger for Intel SandyBridge. |
| 42 | + * Optimize s/dsymv for Intel SandyBridge. |
| 43 | + * Optimize ssymv for Intel Haswell. |
| 44 | + * Optimize dgemv for Intel Nehalem and Haswell. |
| 45 | + * Optimize dtrmm for Intel Haswell. |
| 46 | + |
| 47 | +ARM: |
| 48 | + * Support Android NDK armeabi-v7a-hard ABI (-mfloat-abi=hard) |
| 49 | + e.g. make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7 |
| 50 | + * Fix lock, rpcc bugs (#616, #617. Thanks, Grazvydas Ignotas) |
| 51 | +POWER: |
| 52 | + * Support ppc64le platform (ELF ABI v2. #612. Thanks, Matthew Brandyberry.) |
| 53 | + * Support POWER7/8 by POWER6 kernels. (#612. Thanks, Fábio Perez.) |
| 54 | + |
2 | 55 | ==================================================================== |
3 | 56 | Version 0.2.14 |
4 | 57 | 24-Mar-2015 |
|
0 commit comments