jax
CI: 12/10/24 upstream sync
#176
Merged

CI: 12/10/24 upstream sync #176

charleshofer merged 170 commits into rocm-main from main
charleshofer
dfm Use optimize='auto' for multi_dot.
236d4c60
google-labs-jules[bot] Add exec_time_optimization_effort and memory_fitting_effort flags.
762301fc
pearu Use next to tiny as smallest floating point value on Mac ARM
504c7387
jedborovik Add version check for effort flags
83b54d97
jedborovik Merge branch 'main' into add-optimization-effort-flags
c65ce4b0
houeland Fix jnp.matmul return shape documentation
cd578d97
Rifur13 Save residuals in the decode attention pallas kernel
a4e742d2
apaszke [Pallas:MGPU] Make the shapes from the attention example more interes…
8a316195
rajasekharporeddy Update the docstring of jax.lax.switch
bbc4a20c
apaszke [Pallas:MGPU] Fix a use-after-free in lowering
b1423a36
Google-ML-Automation Merge pull request #25191 from houeland:patch-1
2e0474a5
jakevdp jnp.reshape: raise TypeError when specifying newshape
a7039a27
Google-ML-Automation Merge pull request #25117 from pearu:pearu/arcsin-mac-arm
6b029503
justinjfu [Mosaic GPU] Automatically squash a >3D logical grid into a 3D physic…
784ebeab
Google-ML-Automation Merge pull request #25179 from rajasekharporeddy:lax_switch
385328b5
jakevdp Skip vecmat & matvec in NumPy tests.
f182aa8e
Google-ML-Automation Merge pull request #25055 from dfm:multi-dot
46c748b9
Google-ML-Automation [jax] Typing on common_devices_indices_map
c9a59022
Google-ML-Automation Update XLA dependency to use revision
9f203017
nitins17 Update Cloud TPU workflow with new build.py usage
0134fa83
gnecula [shape_poly] Remove obsolete part of the shape polymorphism documenta…
b3c405c2
Google-ML-Automation Merge pull request #25216 from gnecula:poly_doc
908865f2
emilyfertig Add an option to deactivate automatic cluster detection in jax.distri…
6a8bbcba
Google-ML-Automation Merge pull request #24964 from emilyfertig:emilyaf-deactivate-cluster…
c9c043cf
Google-ML-Automation Merge pull request #25210 from jakevdp:fix-nightly
f2f02eea
andportnoy [Mosaic GPU] Improve default kernel name and add option to customize
7bd81dbe
nireekshak Fix some typos
f43fa9fc
apaszke [Pallas:MGPU] Add tests for attention with non-trivial batch size
0bb68f6a
chr1sj0nes [jax] Improve naming of `DotAlgorithmPreset` properties and simplify …
abf8f430
chr1sj0nes [jax] Make `DotAlgorithmPreset.supported_output_types` a function of …
a54319ec
petebu Fix doc typo
4b6035ca
jakevdp Fix nightly numpy test
2afc65a1
nitins17 Fix missing quotes in local xla path
cc95327a
Google-ML-Automation Merge pull request #25231 from jakevdp:fix-nightly-names
8c66cba4
Google-ML-Automation Merge pull request #25162 from nireekshak:testbranch
dfa0dd70
gflegar Integrate Triton up to [9732c047](https://github.com/openai/triton/co…
c4d19ca8
Google-ML-Automation Reverts a54319ec1886ed920d50cacf10e147a743888464
73962b74
ayaka14732 [Pallas TPU] Enable test for `jnp.logical_not` because it's now suppo…
2dae81a8
Google-ML-Automation Merge pull request #24748 from jakevdp:reshape-dep
d990dcf2
bythew3i [Mosaic TPU] Support packed type matmul with arbitrary shapes.
9e5edb70
Google-ML-Automation Update XLA dependency to use revision
ceeed909
jakevdp Improve trace-time performance of jnp.isscalar
0140a98e
pschuh Add _raw_platform to work around extra platform normalization logic a…
fcf0b6d3
Google-ML-Automation Use JAX's default device instead of jax.devices()[0], if set.
fd4b1608
Google-ML-Automation Merge pull request #25237 from jakevdp:faster-isscalar
40122f7c
jakevdp Fix indexing corner case with empty ellipses
f6f4ef06
nitins17 Convert MSYS' Linux-like paths to Windows paths in JAX CI.
8c78c1e7
dimitar-asenov [Mosaic GPU] Add missing import.
cb2cf56e
cperivol [mgpu_pallas] Optionally pass default value instead of raising an err…
1ddba9b1
cperivol [mgpu_pallas] Allow loading scalars or indexing arrays from gmem usin…
3895e037
apaszke [Mosaic GPU] Add an optimization barrier
11090be0
hawkinsp Fix Windows portability problem in compilation cache test.
5a250097
Google-ML-Automation Merge pull request #25246 from hawkinsp:win
09177cf7
hawkinsp Disable backwards compatibility test for Triton IR.
2ac26924
superbobry [pallas:mosaic_gpu] `emit_pipeline` no longer ignores transforms
12b45b32
superbobry [pallas:mosaic_gpu] Use `jax.tree_util.register_dataclass` for transf…
46eb77be
hawkinsp Disable JaxAotTest.test_topology_pjit_serialize on GPU, which fails i…
bdadc53e
hawkinsp Disable pgle_test on non-GPU platforms.
681b9c2e
yashk2810 Fix the broken behavior of not resetting the abstract_mesh and device…
653f6545
justinjfu [Pallas] Update changelog for `pl.estimate_cost`
721b517e
Google-ML-Automation Merge pull request #25239 from jakevdp:indexing
1da03791
Google-ML-Automation Merge pull request #25006 from andportnoy:aportnoy/mosaic-gpu-kernel-…
fa6585de
jakevdp CI: update array-api-tests to latest commit
8563449a
yashk2810 [sharding_in_types] Use `set_mesh` API to trigger sharding_in_types i…
9e2708eb
Google-ML-Automation Merge pull request #25264 from jakevdp:update-array-api
222b2e75
Google-ML-Automation Merge pull request #25199 from Rifur13:save_residuals
db97d7aa
justinjfu [Pallas] Fix type annotation on TritonCompilerParams
1a3c9c44
yashk2810 Simply abstract_mesh and device_context context managers and handle e…
a735bf83
Remove obsolete deprecation
5ade371c
justinjfu [Pallas] Pallas documentation cleanup
e05afefc
naummo [Mosaic] Add extra memref_slice verification and a memory space check…
3990e05a
lorenrose1013 context manager methods for AbstractMesh to appease type checker.
208194f9
Google-ML-Automation Merge pull request #25114 from jedborovik:add-optimization-effort-flags
182e5326
Google-ML-Automation Update XLA dependency to use revision
28528d44
Google-ML-Automation More thorough propagation of host linear layout. Currently linear lay…
f160df04
tlongeri [Mosaic:TPU] Lift offset restrictions on single-row (1, 128) -> (8, 1…
10116874
mattjj remove vestigial ad.reducing_transposes table
6172a1f1
tlongeri [Mosaic:TPU] Add relayout for adding minor implicit dim and relax som…
8163e74e
Google-ML-Automation [AutoPGLE] Add multi-process test case
7214a3a8
gnecula [shape_poly] Fix the handling of __pow__ for symbolic dimensions
4e17bea9
apaszke [Mosaic GPU] Remove expect_wait from Barrier.wait
c965ffbf
superbobry [pallas:mosaic_gpu] Removed leftover debugging code
03861d43
Google-ML-Automation Merge pull request #25276 from mattjj:remove-vestigial-reducing-trans…
39d73a68
chr1sj0nes Reverts 73962b740890a728295fa09f515dcf96cb820822
569c2a3c
apaszke [Mosaic TPU] Add support for modeling loads/stores and fix minor issu…
d5ead570
apaszke [Mosaic GPU] Always annotate block initialization in the profiles
d034680f
superbobry [pallas:mosaic_gpu] Do not store the grid mapping in `ModuleContext`
e5102957
gnecula [shape_poly] Remove some deprecated kwargs
5fe5206b
superbobry [pallas:mosaic_gpu] Removed unnecessarily strict check in `emit_pipel…
4a41aa0a
gnecula [export] Removed __gpu$xla.gpu.triton (Pallas GPU) from the list of c…
3f5f3e1c
jakevdp jax.numpy: require boolean dtype for where argument
29a8cce6
jakevdp jnp.linalg.vector_norm: properly support multiple axes
aaaee63a
hyeontaek [JAX] Add end-to-end execution support in colocated Python API
e20a483b
Google-ML-Automation Merge pull request #25271 from jakevdp:fix-vector-norm
a71f9a62
Google-ML-Automation Merge pull request #25290 from jakevdp:reduction-where
f73fa7a7
bixia1 [jax:custom_partitioning] Implement SdyShardingRule to support
2a4a0e8d
tlongeri [Mosaic:TPU] Fix fully replicated relayout
23d5c10f
jakevdp array API: improve test coverage
d88ef23a
hawkinsp JAX release 0.4.36.
7e6620a5
justinjfu [Pallas] Fix shard_axis in dma_start interpret mode rule.
259194a6
justinjfu [Pallas] Fix shard_axis in dma_start interpret mode rule.
fd42b561
Google-ML-Automation Merge pull request #25269 from justinjfu:pallas_docs_cleanup
d782b246
tlongeri [Mosaic:TPU] Fix elementwise inference with i1s
651ab188
Google-ML-Automation [pallas] fix jumble test flakiness
84f3f992
hawkinsp Merge branch 'release/0.4.36' after release
ab02bf87
Google-ML-Automation Update XLA dependency to use revision
1ca8903a
Google-ML-Automation Merge pull request #25296 from hawkinsp:postrelease
45159a2f
Google-ML-Automation Merge pull request #25252 from gnecula:poly_power
9fc077a5
hawkinsp Bump JAX version after release.
ba626fa6
Activate Schur Decomposition to XLA's FFI
9081e85d
hawkinsp Set -Werror=mismatched-tags on clang.
fac1b1a7
superbobry [pallas:mosaic_gpu] `FragmentedArray.reduce_sum` now returns a `Fragm…
bae66000
superbobry [mosaic_gpu] Emit a slightly more informative error message in `Fragm…
08d31d0f
jburnim Fix error when swapping a ref with a trivial indexing transform.
af501356
apaszke [Pallas MGPU] Use multiple k/v_consumed_barriers in the attention kernel
8b656206
apaszke [Pallas MGPU] Disable XLA:GPU autotuning in attention tests
eda7506d
charleshofer Don't look for CUDA files when building the ROCm wheel
0c6b967e
Google-ML-Automation Merge pull request #25205 from jburnim:jburnim_swap_fix
72df8e0c
justinjfu [Pallas] Add support for run_state to cost estimator.
641a1d53
nouiz Document cudaMallocAsync as an experimental feature.
a13b618c
Google-ML-Automation Merge pull request #25082 from nouiz:doc_cuda_malloc_async
b6499e21
justinjfu [Pallas] Update TPU documentation
2b2d7cda
nitins17 Add a flag to enable detailed timestamped logging of subprocess comma…
83c64b23
Google-ML-Automation Update XLA dependency to use revision
baedb62b
danijar Support transfer guard in broadcast_one_to_all(). Fixes https://githu…
861115ad
dfm Temporarily allow bfloat16 dot algorithms on CPU.
1f4d184a
Google-ML-Automation Update XLA dependency to use revision
ad00ee1e
gnecula [export] Improved the documentation.
cc73c50c
Google-ML-Automation Update XLA dependency to use revision
70623255
carlosgmartin Fix type annotation for numpy.linalg.matrix_norm argument 'ord'.
efa35ea9
chr1sj0nes [pallas:triton] Add support for `DotAlgorithmPreset` `precision` argu…
3ec55c77
chr1sj0nes [pallas] Add `DotAlgorithmPreset` note to CHANGELOG.
a94474d0
apaszke [Mosaic TPU] Allow downgrading the IR during serialization for forwar…
adb2bf62
Activate Tridiagonal Reduction to XLA's FFI
d474feda
Google-ML-Automation Merge pull request #25338 from carlosgmartin:fix_numpy_linalg_matrix_…
5a1c4c57
superbobry Ensured that JAX type checks under pytype on Python 3.12
1ac6b762
Google-ML-Automation Merge pull request #25320 from ROCm:gh-9948-fix-kernel-build-upstream
cc258f5f
hawkinsp Remove dead code after minimum jaxlib version bump to v0.4.36.
79318a08
ayaka14732 [Pallas TPU] Improve lowerings for boolean comparison operations
9c98c0cb
dougalm Use private names for args in api_util to avoid shadowing kwargs keys.
dd74394e
tlongeri [Mosaic:TPU][NFC] In ext and trunc rules, avoid vreg array reshape by…
b76d264f
berkinilbeyi Reenable for_loop_test on TPU v5p.
f17b2bc2
bartchr808 #sdy enable `test_compute_offload_mesh_with_linear_layout` for Shardy.
6f69774c
dependabot[bot] Bump actions/cache from 4.1.2 to 4.2.0
b6863dfc
Google-ML-Automation Merge pull request #25349 from jax-ml:fix-25329
d908e0ad
hyeontaek [JAX] Add concurrent execution support in colocated Python
296d1670
IvyZX Avoid index out of range error in carry structure check
bd77a703
dfm Add error message when using custom_vmap with reverse-mode AD, and ad…
092d2a0d
Google-ML-Automation Merge pull request #25351 from jax-ml:dependabot/github_actions/actio…
56fcd38d
Google-ML-Automation Merge pull request #25294 from jakevdp:array-api-tests
dba3358d
Google-ML-Automation Merge pull request #25355 from IvyZX:loop-fix
4db533be
Google-ML-Automation Merge pull request #25272 from justinjfu:pallas_tpu_docs_update
1c07ec61
kanglant Disable pjit ArrayPjitTest.test_device_put_grad test on TPU v5e
66b90054
IvyZX Avoid index out of range error in carry structure check
65b60884
dougalm Use private names for args in api_util to avoid shadowing kwargs keys.
95892fda
hawkinsp Update versions for v0.4.37 release.
ffb07cda
dfm Port symmetric tridiagonal reduction GPU kernel to FFI.
32df37e6
Google-ML-Automation Update XLA dependency to use revision
71c48cba
dfm Fix expected exception type in pallas grad tests.
978d35f6
hawkinsp Merge branch 'release/0.4.37' into main.
820f51dc
Google-ML-Automation Merge pull request #25368 from hawkinsp:postrelease
7b5cb56f
dougalm Add a `freeze` primitive to delimit ref lifetimes for AD.
fc2edbfa
jakevdp CI: temporarily pin numpy version for mypy check
a36af966
Google-ML-Automation Merge pull request #25371 from jakevdp:mypy-numpy-version
1743f2c4
yashk2810 Add a no-op batching rule for optimization_barrier_p
944d822c
ghpvnist Introduce `lax.ragged_all_to_all` primitive
12c30578
Google-ML-Automation Merge pull request #25335 from gnecula:export_doc_call
90de28cd
traversaro Update conda-forge installation docs after CUDA 12 upgrade
09309e64
Google-ML-Automation Merge pull request #25374 from traversaro:patch-1
8e7aaa79
Google-ML-Automation Merge pull request #25369 from jax-ml:mutable-arrays-ad
263d4d14
charleshofer charleshofer enabled auto-merge 1 year ago
disabled auto-merge 1 year ago
Manually disabled by user
charleshofer charleshofer merged f04f164a into rocm-main 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone