Chunxiang Xu avadesian
Loading Heatmap…

avadesian synced commits to staging/0.11.2rc1 at avadesian/skypilot from mirror

  • e67e8bfc05 [Core] Always set the SSH key permission (#8316) * Always set the SSH key permission * revert changes in templates * ensure the dir permission as well * safe operations

7 hours ago

avadesian synced commits to ssh-interactive-auth at avadesian/skypilot from mirror

7 hours ago

avadesian synced commits to observability at avadesian/skypilot from mirror

7 hours ago

avadesian synced commits to master at avadesian/skypilot from mirror

  • abf7c92818 [Core] Always set the SSH key permission (#8316) * Always set the SSH key permission * revert changes in templates * ensure the dir permission as well * safe operations

7 hours ago

avadesian synced commits to releases/0.11.1 at avadesian/skypilot from mirror

  • a7380a514f Release 0.11.1 (#8310) Co-authored-by: GitHub Action <action@github.com>

13 hours ago

avadesian synced commits to master at avadesian/skypilot from mirror

  • 54ee820143 [docs] Add NVIDIA Dynamo serving example (#7333) * nvidia-dynamo examples * Trigger CI * update automatic example generation * format * Update examples/serve/nvidia-dynamo/README.md Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com> * Update examples/serve/nvidia-dynamo/README.md Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com> * Update examples/serve/nvidia-dynamo/README.md Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com> * Update examples/serve/nvidia-dynamo/README.md Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com> * address comments * Update examples/serve/nvidia-dynamo/README.md Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com> * address comments * updates * Add image * update banner * update what's next --------- Co-authored-by: Romil Bhardwaj <romil.bhardwaj@gmail.com> Co-authored-by: Romil Bhardwaj <romil.bhardwaj@berkeley.edu>
  • 8572b31924 Fixed plugin load in metrics process (#8318) Signed-off-by: Aylei <rayingecho@gmail.com>
  • 82a0ea1051 [Template] Add `--address` for list nodes to avoid warning for multiple ray cluster and fix a race in ray template (#8306) * Add --address for list nodes to avoid warning for multiple ray cluster * Add debug sleep * Use local ray address for worker for the initial check * remove debug sleep
  • 1aa2398db3 [k8s] Update the instruction for dealing with exec-based kubeconfig (#8210) * [k8s] Update the instruction for dealing with exec-based kubeconfig * update * update * fix comment
  • 3161f0d2b3 [Docs] Add Tip to Restart API Server if Credential Setup Fails (#8314) * Add tip. * Update docs/source/getting-started/installation.rst Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
  • Compare 5 commits »

13 hours ago

avadesian synced commits to lloyd/multiple-jobs-per-worker-p2 at avadesian/skypilot from mirror

13 hours ago

avadesian synced commits to consolidation-docs at avadesian/skypilot from mirror

13 hours ago

avadesian synced commits to master at avadesian/skypilot from mirror

  • 456c6aa828 [Core] Remove cd SKY_REMOTE_WORKDIR step before submitting jobs (#7760)

1 day ago

avadesian synced commits to staging/0.11.0.post1 at avadesian/skypilot from mirror

  • 4584c34911 Cherry-pick: more logs about request operations #8270 #8271 (#8309) * API server: add PID and milliseconds ts in log line (#8270) Signed-off-by: Aylei <rayingecho@gmail.com> * Debug log trace for request write operations (#8271) * Debug log trace for request write operations Signed-off-by: Aylei <rayingecho@gmail.com> * Fix UT Signed-off-by: Aylei <rayingecho@gmail.com> --------- Signed-off-by: Aylei <rayingecho@gmail.com> --------- Signed-off-by: Aylei <rayingecho@gmail.com>

1 day ago

avadesian synced commits to master at avadesian/skypilot from mirror

  • 2eacf88172 [Docs] Update doc for volume mount (#8299) * update doc for volume mount * Apply suggestions from code review Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

1 day ago

avadesian synced commits to refactor-k8s-timeout-calculation at avadesian/skypilot from mirror

3 days ago

avadesian synced commits to master at avadesian/skypilot from mirror

  • ee7ac50a37 [Slurm] Separate stderr for get_job_nodes too (#8300)
  • 5af2bbc9d9 [Slurm] Separate out stderr in SlurmClient command runner (#8298) * [Slurm] Separate out stderr in SlurmClient command runner * reduce duplication * fix ut
  • 6bc8b5826b [Slurm] Better error messages for ssh config missing keys (#8297) * [Slurm] Better error messages for ssh config missing keys * docs * address pr review
  • d1c7d9034e [nightly] disable runpod tests (#8295)
  • a9d51ab37a [Slurm] Add SSH ProxyJump support (#8291) * [Slurm] Add SSH ProxyJump support * Update sky/provision/slurm/instance.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update sky/utils/command_runner.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * log warning if both ssh_proxy_command and ssh_proxy_jump are specified * rm debug log * remove ssh -vvv * unset _ssh_proxy_jump after overlaying _ssh_proxy_command --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
  • Compare 18 commits »

3 days ago

avadesian synced commits to fix-job-submission-sys-path at avadesian/skypilot from mirror

  • 8a2a4e0a86 Merge branch 'master' into fix-job-submission-sys-path
  • afb7768566 [release] fix the helm upgrade test to work with rc versions (#8264) [cd] fix the helm upgrade test to work with rc versions
  • dd21824060 Fix aiohttp version failure in buildkite (#8267) fix
  • e4e2d27653 [UX] Introduce pending state (#8262) * introduce pending state * add info comment * Update sky/utils/status_lib.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
  • 166fd63617 [Dashboard] Add some tests for the dashboard performance (#8227) npm install before verify
  • Compare 22 commits »

3 days ago

avadesian synced commits to riedgar-ms/build-314 at avadesian/guidance from mirror

  • 6c73b371e7 Merge branch 'main' into riedgar-ms/build-314
  • ddf9eed4f9 [Feature] Monitor-guided Inference (#1391) Ability to do inference-time semantic verification on a sequence of tokens ("thoughts", "steps") rather than at an individual token level constrained-decoding.
  • Compare 2 commits »

3 days ago

avadesian synced commits to main at avadesian/guidance from mirror

  • ddf9eed4f9 [Feature] Monitor-guided Inference (#1391) Ability to do inference-time semantic verification on a sequence of tokens ("thoughts", "steps") rather than at an individual token level constrained-decoding.

3 days ago

avadesian synced commits to slurm-ssh at avadesian/skypilot from mirror

5 days ago

avadesian synced commits to releases/0.11.0 at avadesian/skypilot from mirror

  • b2f3519723 Release 0.11.0 (#8255) Co-authored-by: GitHub Action <action@github.com>
  • 59ff48fce7 Release 0.11.0rc2 (#8236) Co-authored-by: GitHub Action <action@github.com>
  • 078dad7ada [core] restore cluster_name_on_cloud from cluster yaml (#8233) * [core] restore cluster_name_on_cloud from cluster yaml Fixes #8232. * add smoke test
  • 0207108092 Fixed incorrect user info in handlers (#8199 #8209) (#8234) * Fixed incorrect user in /enabled_clouds API (#8199) * Fixed incorrect user in /enabled_clouds API Signed-off-by: Aylei <rayingecho@gmail.com> * Comment-s * UT-s --------- Signed-off-by: Aylei <rayingecho@gmail.com> * Fixed incorrect user info in handlers (#8209) * Fixed incorrect user info in handlers Signed-off-by: Aylei <rayingecho@gmail.com> * Update sky/utils/common_utils.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Signed-off-by: Aylei <rayingecho@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Signed-off-by: Aylei <rayingecho@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
  • 4546cbc962 [Core] Put Daemonize Call Back to Make Sky Cancel Reliable (#8203) (#8208) * Add back daemonize call. * Format. * Add new cancellation test. Co-authored-by: lloyd-brown <lloyd@assemblesys.com>
  • Compare 44 commits »

6 days ago

avadesian synced commits to master at avadesian/skypilot from mirror

  • dd21824060 Fix aiohttp version failure in buildkite (#8267) fix
  • e4e2d27653 [UX] Introduce pending state (#8262) * introduce pending state * add info comment * Update sky/utils/status_lib.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
  • 166fd63617 [Dashboard] Add some tests for the dashboard performance (#8227) npm install before verify
  • 791356230a [tests] misc test fixes (#8260) misc test fixes
  • b5ff2fdb54 [deps] pin pycares<5 to work around aiodns issue (#8259)
  • Compare 6 commits »

6 days ago

avadesian synced commits to consolidation-docs at avadesian/skypilot from mirror

6 days ago

Baidu
map