This is an old revision of the document!
John can crack crypt SHA-256 on OpenCL enabled devices. To use it, type: 
john –format=sha256crypt-opencl [other options]
All available GPU power is used while John is running, so the computer can become less responsive, especially if the GPU is used to control your monitor.
The hint is: if your computer seems to hang and you have only one GPU, your X Server is busy and you can't do anything. Just wait 5 or 10 minutes. If nothing happens, reboot your computer. I saw cases on Radeon HD 6770 that after some minutes waiting everything comes to normality.
Despite of that, you can do real crack on your GPU, even if you use it as your graphics controller. Although, you will not be able to use your computer meanwhile.
The maximum password lenght is: 24 bytes.
The maximum salt lenght is: 16 bytes.
In order to change it, anyone can open the opencl_cryptsha256.h, modify the statements seen below and recompile the project. Note that some memory mis-alignment could lead to wrong computation results.
#define SALT_SIZE 16 #define PLAINTEXT_LENGTH 24
John can try to figure out what is the best configuration to use in the running hardware. In this set up mode, John will make some benchmark and take some measures. When finished, it will analyse the results and select the best configuration. Although it is not deterministic, the results (at least) give good hints about the best configuration.
There are 3 environment variables that could be set to configure John's behavior.
To access the auto-configuration mode, the user should set LWS and/or GWS to zero (LWS=0 and/or GWS=0).
Usage:
LWS=0 GWS=5120 ./john -fo:sha256crypt-opencl -t LWS=0 GWS=0 STEP=256 ./john -fo:sha256crypt-opencl -t LWS=32 GWS=0 STEP=1024 ./john -fo:sha256crypt-opencl -t LWS=32 GWS=0 STEP=0 ./john -fo:sha256crypt-opencl -t
One should:
LWS=32 GWS=512 ./john -fo:sha256crypt-opencl -t
My default first try on new hardware is:
LWS=32 GWS=0 STEP=512 ./john -fo:sha256crypt-opencl -t
===== Configuration =====
==== Tested Hardware ====
ID : processor, memory
    * C-01: AMD Phenom™ II X6 1075T Processor × 6, 4GB DDR3 1800MHz
    * C-02: AMD FX™-8120 Eight-Core Processor
    * C-03: AMD Radeon HD 6770 (Juniper)
    * C-04: AMD Radeon HD 7970 (Tahiti)
    * C-05: GeForce GTX 570
==== Tested Software ====
  * S-01:
    * SDK 2.6 OpenCL 1.1 AMD-APP (898.1)
    * Catalyst 12.2
    * Driver 8.95-120214a-134397C-ATI
    * Ubuntu 11.10 x86_64
  * S-02:
    * SDK 2.6 OpenCL 1.2 AMD-APP (923.1)
    * Catalyst 12.2
    * Driver 8.95-120214a-134397C-ATI
    * Ubuntu 12.04 x86_64
==== Benchmarks ====
ID : configuration. Operating system.
   * C-01 : Local work size (LWS) 1, global work size (GWS) 768. S-02.
     * Raw: 1884 c/s real, 315 c/s virtual 
   * C-02 : Local work size (LWS) 1, global work size (GWS) 1024.
     * Raw: 1697 c/s real, 212 c/s virtual
   * C-03. S-02.
     * Local work size (LWS) 185, global work size (GWS) 11100
     * Benchmarking: sha256crypt (rounds=5000) [OpenCL]… DONE
     * Raw:	10277 c/s real, 1110K c/s virtual
   * C-04.
     * Local work size (LWS) 128, global work size (GWS) 81920
     * Benchmarking: sha256crypt (rounds=5000) [OpenCL]… DONE
     * Raw:	50881 c/s real, 2048K c/s virtual
   * C-05.
     * Local work size (LWS) 256, global work size (GWS) 23040
     * Benchmarking: sha256crypt (rounds=5000) [OpenCL]… DONE
     * Raw:	19361 c/s real, 19443 c/s virtual
Reference (july 2012) 
⇒ John Jumbo on CPU C-01 (1 core) 
Benchmarking: sha256crypt (rounds=5000) [32/64]… DONE 
Raw:	367 c/s real, 367 c/s virtual 
⇒ John Jumbo with OMP on CPU C-01 (6 cores) 
Benchmarking: sha256crypt (rounds=5000) [32/64]… (6xOMP) DONE 
Raw:	1957 c/s real, 329 c/s virtual 
⇒ John Jumbo crypt(3) with OMP on CPU C-01 (6 cores) 
Benchmarking: generic crypt(3) SHA-256 rounds=5000 [?/64]… (6xOMP) DONE 
Many salts:	1333 c/s real, 227 c/s virtual 
Only one salt:	1321 c/s real, 227 c/s virtual 
⇒ John Jumbo on CPU C-02 (1 core) 
Benchmarking: sha256crypt (rounds=5000) [32/64]… DONE 
Raw:	300 c/s real, 298 c/s virtual 
⇒ John Jumbo with OMP on CPU C-02 (8 cores) 
Benchmarking: sha256crypt (rounds=5000) [32/64]… (8xOMP) DONE 
Raw:	1761 c/s real, 220 c/s virtual
⇒ John Jumbo crypt(3) with OMP on CPU C-02 (8 cores) 
Benchmarking: generic crypt(3) SHA-256 rounds=5000 [?/64]… (8xOMP) DONE 
Many salts:	1333 c/s real, 166 c/s virtual 
Only one salt:	1333 c/s real, 166 c/s virtual 
===== External Links =====
[1] AMD Accelerated Parallel Processing Programming Guide, visited on 04/19/2012
http://developer.amd.com/sdks/amdappsdk/assets/AMD_Accelerated_Parallel_Processing_OpenCL_Programming_Guide.pdf.