implement predict command with shared prediction helpers
Largest command — includes JSON/individual input parsing, base64 file
encoding, output schema inspection, file output handling, GPU retry
logic, and signal handling.
Helper functions (runPrediction, predictJSONInputs, etc.) are shared
with the train command. useReplToken is passed as a parameter instead
of a package-level var.