Checks for None tensors and skip them when splitting the buckets in zero stage 2. #728
add flops count profiler
86a33ca5
add run script for the small test model
eedb939c
fix flops sum and add batch counter
0b0b7270
remove hook handles
558ad138
add conv2d flops
22ae7520
add flops compute for major functionals and rnn modules
b093b716
work
5ce1d1fa
fix flops count cal in post hook
2575dd39
add flops of embedding and dropout as 0
f5d18923
fix
d4d332e1
add duration and throughput
b3914c7f
add time and throughput
dbc24c85
add basic tracer for wall clock time breakdown
3ad7bc12
Added top module info summary
e8bb9f48
refactor
e67c6504
refactor and add readme
4c9db1af
reorg folders
01db9435
update readme
f91b552a
fix xsp init import
964c689b
update readme and rename batch to step
68b5260a
fix multiple steps calc
9f38c8db
update readme
83ba5edb
rename pytorch-profiler to flops-profiler
9bd30d3a
rename pytorch-profiler to flops-profiler
a061bf37
fix steps calc and update readme
c6a6a3f7
update readme
9c4f3ad1
update ds
10aee869
remove tracer code
aecccaf5
fix incorrect merging
c6dfa070
fix formatting
b48ed309
fix
ab576677
fix sort value divided by steps
6e5ad79f
print info when steps is 0
03a521cb
Merge branch 'master' into cheng/flops_profiler
9b1a7b62
add flops profiler class and unit test
8ad406b6
delete examples
99e97e1c
add reload functionals in end profile
8403aaff
use config dict for flops profiler config
d1aaaf96
add doc strings and update readme
b26f2fbe
Merge branch 'master' into cheng/flops_profiler
cb2dfa0f
Merge branch 'master' into cheng/flops_profiler
78c4cc37
Merge branch 'master' into cheng/flops_profiler
0a1b04f5
Merge remote-tracking branch 'upstream/master' into master
5019a6eb
Merge remote-tracking branch 'upstream/master'
859b837a
Merge remote-tracking branch 'upstream/master' into master
625df299
check none tensors when splitting buckets
31830847
cli99
marked this pull request as draft 5 years ago
Merge branch 'master' into fix_zero2_none_tensor
246d53bb
cli99
marked this pull request as ready for review 5 years ago
samyam
approved these changes
on 2021-02-16
Merge branch 'master' into fix_zero2_none_tensor
dbbbb7f5
cli99
merged
7cab55c7
into master 5 years ago
cli99
deleted the fix_zero2_none_tensor branch 5 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub