[CUDA][TEST] Implement UR_KERNEL_SUB_GROUP_INFO_COMPILE_NUM_SUB_GROUPS and enable tests
- Calculate compile_num_sub_groups from reqd_work_group_size and reqd_sub_group_size metadata
- Remove UUR_KNOWN_FAILURE_ON for CUDA from CompileWorkGroupSize and CompileNumSubGroups tests