[GPU]Fix macro multiple register and micro kernel block size issue (#31651)
### Details:
- Fix macro 'HEADS_PER_WI' multiple register issue
- Fix micro kernel block size issue when compute aligned_seq_len
### Tickets:
- *CVS-171882*
Co-authored-by: Chen Peter <peter.chen@intel.com>