vsoch open issue compspec/compat-lib#5.
Generate perfetto output
We would want to visualize library access over time with perfetto….View Comment
vsoch open issue converged-computing/lammps-time#3.
Weekend (early) experiment plans
I’m interested to see how this looks across a more reasonably sized run, which doesn’t need to be “HPC level” but I want to last a few minutes. I want to see the degree to which patterns vary across a set of nodes (all running LAMMPS, for example) and I’m also interested in the following ideas / use cases: …View Comment
vsoch open issue converged-computing/performance-study#73.
OSU Issue
- [ ] Azure AKS missing 128 32 and 256 in plots - we have the data so must be the global function to filter is filtering it out. …View Comment
vsoch open issue converged-computing/performance-study#72.
Stream Work to Do
- [ ] Each analysis needs to be redone with the data grouped by how it was run. …View Comment
vsoch closed issue flux-framework/flux-operator#236.
Add topo field
We should be able to specify a custom topology, e.g.,: …View Comment
vsoch open issue etetoolkit/ete#762.
status of library?
Hi there! :wave: This looks like a beautiful library, and one I’d like to use (we have a tree based scheduler for HPC that would be well afforded by many of these visualizations) but I cloned the repository and was trying some examples out, and there are still print statements from Python 2. I had wanted to generate the bar chart alongside the tree example and was trying to start with cluster examples, but they look like they haven’t been updated in many years. …View Comment
vsoch open issue converged-computing/fluxgen#3.
Create mode to just install flux
We might not need the config (e.g., if we are running under slurm) so it should not be required….View Comment
vsoch open issue compose-spec/compose-go#707.
Example to read config file from string?
This is likely a more rare use case, but I want to use go embed to include a string-ified compose file (that loads into a Project) and I’m wondering if you have an example to do that? I’m looking here: https://github.com/compose-spec/compose-go/blob/b9d3b1deec9bc25de0fa81867ca4df75aef5c791/cli/options.go and I see that ReadConfigFiles can accept a path or stdin, but not a loaded string. I can try to dig deeper and see if I can come up with something, or fall back to reading from file for now, but I’m hoping someone has come up with something. Thanks!…View Comment
vsoch open issue flux-framework/flux-k8s#86.
Abstraction of "socket" missing
Fluence adds socket resources (e.g., core) but doesn’t actually represent a socket in the graph. If we want to have this comparable to HPC setups, we likely want to add this in….View Comment
vsoch open issue moby/moby#48786.
Feature request question: development (build without failure) mode
### Description…View Comment
vsoch open issue converged-computing/kubernetes-jobs#1.
Feature requests for Kubernetes
This is an interesting thread on reddit (of a similar nature to what I did here) - a source of information about what people want. …View Comment
vsoch open issue converged-computing/ensemble-python#10.
idea: ensemble members separate
Right now, we are submitting isolated ensembles, meaning single yaml “ensemble.yaml” files. …View Comment
vsoch open issue flux-framework/flux-operator#235.
External API and client
If we want to present this as a full cluster, it would be interesting to try creating a grpc external API service (written in GO) that would allow external submission, and depending on the audience we are targeting, does not require users to rewrite their jobs. I need to think more about that last piece….View Comment
vsoch open issue vsoch/gridtest#42.
Creat experiment generation module
I’d like to be able to use gridtest to generate a matrix of parameters. E.g., I want to run many applications across problem sizes for a cloud, so I’d like to define that in a file and then populate templates with jinja. It’s a fairly simple thing, and should be easy to do here….View Comment
vsoch open issue rootless-containers/usernetes#338.
Running with different mode for `rp_filter`?
It looks like rp_filter is required to be 2: …View Comment
vsoch open issue acrlabs/simkube#132.
Feedback on Installation / Getting Started
Here is some early feedback on your documentation. First - I’d recommend having it rendered via readthedocs (or sphinx locally) and remove it from your branded website. The format of this site and navigation is hard to read, and it’s distracting to be alongside your profile, about, and basically many prompts to ask for funding. The project should stand alone on its own site. For example, for our flux projects we use a GitHub pages rendered mkdocs template (e.g.,). …View Comment
vsoch closed issue converged-computing/performance-study#28.
size 32 config missing from amg2023 experiments
There should be a size 32 here: https://github.com/converged-computing/performance-study/tree/main/experiments/azure/cyclecloud/gpu/experiment/configs/amg2023 …View Comment
vsoch closed issue LLNL/AMG2023#12.
Building with Hypre + Cuda
Hi! I’m having some trouble building AMG2023 with hypre and cuda, and trying my best to follow the instructions. The spack build approach doesn’t work because of an issue of detecting the cuda arch / variant of the system (it comes up as compute_none
so I’m trying the approach to build in a container. What I currently have:
…View Comment
vsoch open issue eksctl-io/eksctl#7949.
[Bug] placement group should be optional for capacity reservations
When we create a cluster with a capacity reservation (e.g., with GPUs) it’s often the case that EFA comes with the set. However, setting a placement group can actually limit the instances you get in the reservation. We had a reservation for 16, but because eksctl sneakily added a default placement group, we only were able to get 11. I was unable to set the groupName to null and had to comment out this block https://github.com/eksctl-io/eksctl/blob/fc24e94034492faeda18c43e17fade077c2ff090/pkg/cfn/builder/managed_launch_template.go#L80-L87 with a custom build of eksctl to create out cluster with a full 16. My suggestion would be to either add an explicit flag that says “don’t make a placement group with eFa” or allow the user to create a cluster with efa, but give red warnings about needing the group. …View Comment
vsoch open issue flux-framework/Tutorials#44.
Add tutorial survey link to main survey
To make it easy to find / click, we should have the survey linked directly at the end of the tutorial. This is a form that I can add right before we do the survey. It is also in the slides….View Comment
vsoch open issue Mellanox/k8s-rdma-shared-dev-plugin#114.
naive question about devices
Hi! I’m new to setting up infiniband, and am working on getting a setup in Azure. I have it working on a set of nodes in AKS, and I’m working on binding to the pod. My naive question is about the output (on the node) where I see two devices, mlx5_0 and mlx5_1. I notice that one is infiniband and one is ethernet: …View Comment
vsoch closed issue converged-computing/fluxnetes#4.
Discussion: do a reservation instead
I think instead of MatchAllocate
we should do MatchAllocateElseReserve
and then we don’t need to run the job again - the work will be scheduled for a future point. I haven’t ever used this function, but I’m assuming since a single schedule cycle updates these reservations, we likely need to schedule this to run again at a time in the future (the time of the reservation) and run MatchAllocateElseReserve
again (and keep going until it’s allocated) or use a separate queue.
…View Comment
vsoch closed issue converged-computing/container-chonks#1.
Next things for docker container chonker experiments
- [x] look at number of layers over time …View Comment
vsoch open issue flux-framework/flux-k8s#80.
Segfault in PreFilter
The current CI tests are failing because: …View Comment
vsoch open issue flux-framework/flux-sched#1261.
reservation cleanup: single traverser idea
An idea (already mentioned to @milroy and @trws) is to have a single traversal that can handle cleanup of reservations. I can talk about this in context of the easy queue policy. …View Comment
vsoch open issue compspec/jobspec#20.
plugins or command snippets
## support for custom (commonly used) snippets …View Comment
vsoch open issue rajibhossen/flux-k8s-scaler#1.
Repository Organizational Suggestions
- Main README: under the title explain the repository includes code and data for experiments published in X. Have an explicit link to each directory with a description of what is found inside. The current content is great - I’d move into an Experimental Summary section there. …View Comment
vsoch open issue converged-computing/fluxnetes#4.
Discussion: do a reservation instead
I think instead of MatchAllocate
we should do MatchAllocateElseReserve
and then we don’t need to run the job again - the work will be scheduled for a future point. I haven’t ever used this function, but I’m assuming since a single schedule cycle updates these reservations, we likely need to schedule this to run again at a time in the future (the time of the reservation) and run MatchAllocateElseReserve
again (and keep going until it’s allocated) or use a separate queue.
…View Comment
vsoch open issue riverqueue/river#474.
best strategy to update job args (or other metadata) to send to subscription?
Hi! Apologies for the many questions - I’m into a nice groove and am loving using River, and I’ve been searching code and discussions first before asking a question, to the best that I can. …View Comment
vsoch open issue riverqueue/river#471.
workers not running
Hi! I have a setup that I’m trying to get working here and I’m trying to do a basic job Insert, and then verify that the worker is running. I’ve tried everything I can think of to do, and although I see that the insert returns a jobid (and reports adding to the queue): …View Comment
vsoch open issue flux-framework/flux-docs#271.
Tutorial container and instruction
We got good feedback today that it would be nice to have our Flux Tutorial more easily available on demand, and I agree. For this, after we finish for RADIUSS I’ll build a final variant of the “spawn” image and add some page here that more officially shows how to run it….View Comment
vsoch closed issue flux-framework/spack#202.
[package-update] for flux-sched: update-package/flux-sched-2024-07-10
This is a request to open a pull request for a package update….View Comment
vsoch open issue skypilot-org/skypilot#3751.
[clouds] Support Flux Framework as a backend
Hi! I’m looking at this guide and trying to figure out where Flux fits. It could traditionally be thought of as an HPC scheduler, but we also have deployments for AWS and Google Cloud that go to raw VMs (e.g., with Terraform for example). I was looking at this issue https://github.com/skypilot-org/skypilot/issues/3072 and it seems to suggest a good starting point - running under Kubernetes. We already have the Flux Operator (I am lead developer on and cloud do development here) that deploys an entire flux cluster for Kubernetes. …View Comment
vsoch open issue rootless-containers/usernetes#335.
Usernetes with bypass4netns on multi-node
Hi @AkihiroSuda ! :wave: …View Comment
vsoch closed issue flux-framework/spack#189.
[package-update] for flux-core: update-package/flux-core-2024-06-05
This is a request to open a pull request for a package update….View Comment
vsoch open issue kubernetes/kubernetes#125852.
/dev/shm can be oversubscribed compared to host
Hi! I noticed that when we create the emptyDir and specify Memory as the storage medium and allow to calculate the sizeLimit “to match the host” - it does so based on the pod resource spec, here. I was testing this on an AWS hpc7g instance, which has 64 GB in /dev/shm
(on the host) and the entire host has 128GB:
…View Comment
vsoch open issue parca-dev/parca#4769.
Practical questions for monitoring applications
Hi! I tested out parca today for the default tutorial on a small Kubernetes cluster, and it’s awesome! I was able to run a few pods with an HPC (MPI) application, lammps, and see an expected pattern: …View Comment
vsoch open issue spack/spack#44678.
linux-pam: c compiler cannot create executables
There was a change to linux-pam merged yesterday: https://github.com/spack/spack/commit/9151fc1653317059097ae51a238cbb7c05197240 …View Comment
vsoch open issue flux-framework/spack#194.
linux-pam: change broke builds
I opened an issue here: https://github.com/spack/spack/issues/44678. …View Comment
vsoch open issue converged-computing/container-chonks#1.
Next things for docker container chonker experiments
1. look at number of layers over time …View Comment
vsoch closed issue flux-framework/spack#193.
[package-update] for flux-sched: update-package/flux-sched-2024-06-07
This is a request to open a pull request for a package update….View Comment
vsoch closed issue flux-framework/spack#190.
libuuid is deprecated
See https://github.com/spack/spack/blob/391c4cf099e67e5eb3f3734d30946c890a4442f6/var/spack/repos/builtin/packages/libuuid/package.py#L9 …View Comment
vsoch open issue compspec/compspec-modules#1.
Review of structure
Example is here: …View Comment
vsoch open issue volcano-sh/volcano#3503.
Why called volcano?
Hi! I have a more fun question than anything else - why is the project called “volcano” and is there a story behind it? Apologies if I’m missing something really obvious. :laughing: …View Comment
vsoch open issue flux-framework/flux-k8s#71.
Post refactor changes needed
- [x] Update fluence to go 1.20 or 1.21: We are going to hit issues using fluence (go 1.19) with other integrations like rainbow (go 1.20) and on our systems (go 1.20), and after #69 should consider updating. …View Comment
vsoch open issue bpfman/bpfman#1143.
bytecodeError hard to debug
Hi! I’m having a really hard time getting started using your library. I ran the examples OK, but when I modify the code a little bit to be abount counting sys_execve, no matter what I do, I get a bytecodeerror …View Comment
vsoch open issue isala404/kube-ebpf#1.
Updated version?
Hi! I’m trying to test this out (just locally with kind to start) and there is an issue with the init-headers: …View Comment
vsoch open issue kuenzelab/ENDURE#2.
Dockerfile
Hi! Just wanted to check that this is no longer valid: …View Comment
vsoch open issue converged-computing/rainbow-experiments#3.
Next spack experiments
- [ ] Fix bug that clusters are added to config (with empty token) …View Comment
vsoch open issue LLNL/AMG2023#12.
Building with Hypre + Cuda
Hi! I’m having some trouble building AMG2023 with hypre and cuda, and trying my best to follow the instructions. The spack build approach doesn’t work because of an issue of detecting the cuda arch / variant of the system (it comes up as compute_none
so I’m trying the approach to build in a container. What I currently have:
…View Comment
vsoch open issue snakemake/snakemake-storage-plugin-gcs#37.
rename to -plugin-gs?
@johanneskoester there is a bit of confusion about if the plugin is gs or gcs - that of course advises the prefix. If it is gs should we rename to that?…View Comment
vsoch open issue memgraph/memgraph#1975.
Question: suggestion for storing state?
Hi! I’m new to using Memgraph, and am going to test it for a graph database backend for our resource manager and scheduler. I’m wondering how memgraph (if it can) represent state? For example, the basic graph is resources (nodes) and relationships between them (a node has some number of cores) and then the state would be whether some subset of those cores are allocated (being used) or not. So typically when we have a scheduling problem we consider these states to determine if a request for work can be matched (to free resources). Any suggestions / pointers would be appreciated, thank you!…View Comment
vsoch open issue Textualize/textual#4442.
Using Textualize for a tutorial?
Hi! I’m wondering if anyone has ever put together components / widgets in a way to run a tutorial. As an example, if we have a job submission tool, we might have something that looks like chapters with reading and code snippets: …View Comment
vsoch open issue LINC-BIT/k8sSimulator#3.
Missing Code?
Hi! I found your paper https://www.mdpi.com/2072-666X/14/3/651 and was curious if you shared any of your code? I don’t see GitHub references in the paper and the content here is fairly sparse. The README suggests that a library exists (but possibly you don’t want to make it public)?…View Comment
vsoch open issue kubernetes-sigs/scheduler-plugins#722.
Rejection due to timeout / unreserve
Hi! I want to make sure I’m not doing anything wrong. I bring up a new cluster on GKE: …View Comment
vsoch open issue flux-framework/flux-k8s#73.
Test using Active queue "Activate Siblings" vs Current approach
Coscheduling uses a strategy of moving siblings to the active Q when a pod that is about to hit a node hits the Permit endpoint. The strategy I have in place to schedule the first pod seems to be working OK, but I’d like to (after we merge the current PR) test this approach. I can see pros and cons to both ways - having to rely on another queue (subject to other issues) seems less ideal than having them all scheduled at the right time. On the other hand, if something might happen with the latter approach that warrants the active queue, maybe it makes sense. I think empirical testing can help us determine which strategy we like best (or even a combination of the two)….View Comment
vsoch open issue flux-framework/flux-k8s#71.
Update fluence to go 1.20 or 1.21
We are going to hit issues using fluence (go 1.19) with other integrations like rainbow (go 1.20) and on our systems (go 1.20), and after #69 should consider updating. …View Comment
vsoch open issue converged-computing/rainbow#25.
The gnomes have things to say
I hate to disappoint, there are no cookies here. :cookie: :cookie: …View Comment
vsoch open issue converged-computing/rainbow#24.
Add robust logging library to control verbosity
As my experiments are getting very large, I am commenting out debugging messages so the terminal doesn’t explode. It would be better to use a logging library proper….View Comment
vsoch open issue converged-computing/ensemble-operator#11.
Add release workflow to generate manifests and builds
vsoch open issue spack/spack#43331.
cache errors recently?
Hi! We are getting 1-2 errors nightly (seems to be picking up) for our automation, and it’s usually some kind of checksum or connectivity thing (resolved by a rebuild) …View Comment
vsoch open issue converged-computing/rainbow#20.
Additional endpoints for graph subsystems
We should be able to: …View Comment
vsoch closed issue spack/spack-stack-catalog#8.
Large search leading to timeouts
We likely need to tweak the update strategy (or API call method) to better address this - we are hitting over 6 hours a few times a week now….View Comment
vsoch closed issue compspec/jobspec#6.
ui preference: stage vs copy
I think I want to more explicitly have: …View Comment
vsoch open issue compspec/jobspec#6.
ui preference: stage vs copy
I think I want to more explicitly have: …View Comment
vsoch open issue go-hep/hep#1010.
distributed use cases?
Hi! I’m looking for a go framework to run experiments on an HPC cluster, and I’m wondering if any of your simulations are amenable to that? Basically I can have a set of connected nodes and (if the library supports something like distributed) I’m hoping to use them. Thanks!…View Comment
vsoch open issue compspec/compspec#22.
Compspec generic generator
Compspec python (or maybe a plugin, undecided) should be able to take a generic request to generate a cluster node graph for a subsystem based on nodes. For example, starting with these nodes instead of manually creating the io subsystem (which I did) I should be able to say “Take my yaml definition file that defines these basic node types, and generate the equivalent JGF and (with my cluster nodes in context) attach them to all nodes). …View Comment
vsoch open issue compspec/compspec-ior#2.
Extraction of node I/O metadata
Not sure the extent to which ior will be used, but I want to put notes here for how we can eventually automate generation of the subsystem metadata for nodes. …View Comment
vsoch open issue compspec/compspec-go#32.
name exported in cluster nodes
We currently have a “name” at the top, which largely gets parsed over if we just look at the graph attribute. E.g.,: …View Comment
vsoch closed issue flux-framework/spack#162.
[package-update] for flux-core: update-package/flux-core-2024-03-05
This is a request to open a pull request for a package update….View Comment
vsoch open issue compspec/compspec#21.
create compspec-spack
I’m thinking ahead - and I think it would be useful to have a compspec-spack extractor. I think I already have most of the code for this, actually, in our prototype spack experiment, and I just need to make the repository here that properly wraps it. The use cases for this would be post build to describe software in a spack install. We would also want to think about the use case for pre-build - likely this would just be information about the environment needed….View Comment
vsoch open issue flux-framework/flux-operator#218.
Update openapi-gen
We are currently pinned to handle some bugs, note that we want to update to latest after fixed. …View Comment
vsoch open issue containers/nri-plugins#274.
Using plugins out of tree / not with Kubernetes?
Hi! I’m particularly interested in these two for functionality: …View Comment
vsoch open issue converged-computing/rainbow#10.
Graph Databases
I keep finding new / different ones, so want to make a list somewhere: …View Comment
vsoch open issue compspec/compspec-go#28.
Investigate nri-plugins for topology
https://github.com/containers/nri-plugins/blob/main/pkg/topology/topology.go…View Comment
vsoch open issue c0mm4nd/go-hwloc#1.
topology can't be allocated in Go (Is this being maintained)?
I just tried adding to my library, and: …View Comment
vsoch open issue flux-framework/fluxion-go#7.
Conversion of JGF v2 to JGF v1
Fluxion currently requires JGF version 1. The version 2 is a better design that allows for indexing the nodes based on name (as opposed to a list) and there are subtle differences in the design. I would prefer to continue developing tools (e.g., compspec-go) anticipating v2, so I think an OK strategy for the time being will be to allow fluence to accept a v2 and convert to v1 to give to flux. I’ll be able to implement this shortly when I’m ready (I’m currently adding the export of the cluster graph from compspec-go, which I’m going to use version 2 for). This will mean that: …View Comment
vsoch open issue compspec/compspec#18.
Dinosaur TODO
## Software …View Comment
vsoch open issue flux-framework/flux-k8s#68.
Design Problems for Fluence
I think I’ve been working on this over 30 hours this weekend and want to write down some concerns I have about #61, which is still not fully working with the new “bulk submit” model. …View Comment
vsoch closed issue flux-framework/fluxion-go#2.
test suite / ci planning
We need to, for the test suite: …View Comment
vsoch open issue draperlaboratory/fracture#61.
Repository still active?
Hi! Is this project still active? The autoconf required is really old. Thanks!…View Comment
vsoch open issue ahmedbougacha/dagger#25.
Suggested OS / version / other versions of dependencies?
Hi! I’m trying to build this in an ubuntu jammy docker container, and using all the default libraries from api. I hit about 90% and get: …View Comment
vsoch open issue aengelke/rellume#11.
More advanced examples?
Hi! Do you have examples that go beyond a single function as byte code? I’m wondering where to start with an entire binary. Thanks!…View Comment
vsoch open issue spack/spack#42615.
spack install lammps with mpich fails - libfabric issue
This looks to be an issue with the package, it’s expecting a lib64 path that does not exist. …View Comment
vsoch closed issue rootless-containers/usernetes#321.
Testing on ARM
I’m building a node for arm, and I noticed the Dockerfile is hard coded to a sha, which I would suspect is for AMD64. I am wondering if it would make sense to remove the sha pin and allow for the platform to be selected instead? What I’m doing now is trying out an arm sha and can report back. It does look like the rest of the Dockerfile is going to account for arm (e.g., CNI plugins). I suspect there might be more issues though….View Comment
vsoch open issue flux-framework/flux-sched#1142.
Question about binding include (header) files
This is probably a set of dumb questions, but: …View Comment
vsoch open issue converged-computing/half-baked#1.
Potential Mini Apps (or similar)
- [ ] Dagger (and examples) …View Comment
vsoch open issue converged-computing/flux-usernetes#2.
custom ARM digest not needed
If we rebuild the AMI we don’t need the custom Dockerfile, see https://github.com/rootless-containers/usernetes/issues/321…View Comment
vsoch open issue krator-rs/krator#76.
Design document?
Hiya! I was interested to learn about state machines (and in the context of rust) and found your project. I usually read code to understand how things work, but I’m not great with rust and was wondering if you had any design diagrams or similar somewhere, and that could help me read the code. Thanks!…View Comment
vsoch open issue supercontainers/compspec-go#17.
Schema graphs should be cached too
vsoch open issue rse-ops/lammps-matrix#3.
Kernel version
We likely want to add this for the next round of experiments, it didn’t seem to be an issue this time because all the runs worked without it….View Comment
vsoch open issue supercontainers/compspec-go#13.
LAMMPS Experiment needs
- [ ] Ability to ask to shuffle chosen images …View Comment
vsoch open issue GoogleCloudPlatform/hpc-toolkit#2202.
Example of startup script with cluster without vm-instance?
Hi! Most of the examples with startup-script have an associated vm-instance, e.g., …View Comment
vsoch open issue supercontainers/compspec#7.
Validation for nodes and edges
We will eventually want to validate that every node is connected to the graph. I have some checks in my prototype tooling, but likely this will bubble up to a higher level maintained tool. We don’t want any single node graphs! Tree… nubs. :laughing: …View Comment
vsoch open issue converged-computing/jsongraph-go#6.
feat: add support for merge function
it’s fairly common to want to merge two graphs. I’m writing the function now in another library, and can consider adding it here….View Comment
vsoch open issue flux-framework/flux-core#5709.
discussion: go bindings for flux-core
As we are planning on integrating flux better with Kubernetes, it would be nice to have flux-core expose Go bindings. As an example, the approach I’m taking now to expose a flux instance via a service -> ingress -> API that can be interacted with externally is via a sidecar in the main broker pod that is exposed via a port. This means having one physical node with one pod, and within the pod, two containers - one running an interactive flux cluster (it starts up and basically sleeps forever until we are done) and the second running the restful service (they share a common volume that has the flux socket so the second container can connect to it, see the queue, submit jobs, etc). …View Comment
vsoch open issue flux-framework/flux-k8s#60.
When update go version: test pending
Newer versions of the go API have support for a PENDING state that would skip the backoff queue, and I think this might be helpful / useful. I did (stupidly) try it this weekend (updating library versions) and it led to like, 5 hours of debugging mysterious panics, so yeah, not something we should do soon/first with all the current debugging we still need to do! But I wanted to put a note because I saw it was added recently….View Comment
vsoch open issue GoogleCloudPlatform/hpc-tools#4.
Missing support for ARM
Hi! I’m trying to build this into an ARM base, and there is an exec format error (I suspect no support)? …View Comment
vsoch open issue nushell/integrations#2.
Debian Packaging
I reached out to @yarikoptic about proper Debian packaging (I’ve never done it!) and he has shared his wisdom! Here is his proposal for going about this: …View Comment
vsoch open issue flux-framework/flux-sched#1134.
Fluxion go bindings: handles needed for job info and resource graph summary
We need to expose additional functionality with our Go bindings to make it easier to work on fluence and debug. Specifically I’d like to be able to: …View Comment
vsoch open issue kubernetes/sample-cli-plugin#6.
How to add TLS?
Hi! Is there an example for how to pass on the certificate-authority-data and do a grpc.Dial from here that is secure? The withInsecure works for my local development cluster, but for a cloud cluster, even when I expose the grpc service port via ingress it times out. I assume it’s related to TLS and that I’m explicitly not setting it. E.g.,: …View Comment
vsoch open issue flux-framework/flux-operator#214.
feature: suspendWorker boolean or retry count
We should have either a boolean or count for the user to decide how many times to retry worker pods. The use case is adding a new minSize that allows the minicluster to start with fewer workers. If the other workers eventually come up (and after the job is done) the pods might persist (and not complete) as they are retrying. We would want to avoid that….View Comment
vsoch open issue spack/spack#42069.
all fetchers failed for spack-stage-patch
Does this mean flakiness in some URL for a patch or something else? …View Comment
vsoch open issue snakemake/snakemake-plugin-catalog#2.
Add flux
hey @johanneskoester what do we need to do to add flux here? …View Comment
vsoch closed issue kubeflow/pipelines#9703.
[feature] Continued support for resourceOp / custom resource definitions
### Feature Area …View Comment
vsoch open issue flux-framework/flux-core#5674.
Expected warnings with Python 3.12
There might be more, but I’m building flux-core in a container with Python 3.12 and am running into some warnings! Here they are (and fixes): …View Comment
vsoch open issue spack/spack#42001.
boost checksum error
For the build here: https://github.com/flux-framework/spack/actions/runs/7455858988/job/20286295396 …View Comment
vsoch open issue flux-framework/flux-operator#212.
Notes on metrics
Note that if we ever need the flux operator to export metrics, kubebuilder has a default setup: https://book.kubebuilder.io/reference/metrics …View Comment
vsoch open issue converged-computing/kubescaler#18.
Experiment with GKE scaler to add spot instance groups
I recently adopted the linear programming algorithm to calculate optimized instance groups for Google Cloud, and (superficially) it looks like we might have some promising combinations, meaning groups of spot instances that beat on demand prices. https://github.com/converged-computing/metrics-operator-experiments/tree/main/google/spot-instances/run0. To support this, we will need to do similar to what I added to AWS to create the cluster and then add on / delete managed node groups. GKE, unlike AWS, does not allow for an empty control plane, so I might need to create a cluster with one tiny node first, not sure because I haven’t looked at this library in a while (but will soon)….View Comment
vsoch closed issue flux-framework/spack#147.
[package-update] for flux-core: update-package/flux-core-2024-01-03
This is a request to open a pull request for a package update….View Comment
vsoch open issue converged-computing/cloud-select#37.
Test out Google Pricing API
This might be harder to parse, but worth a shot. I didn’t see this before (or maybe I did and didn’t know how to make sense of it)! …View Comment
vsoch open issue flux-framework/flux-core#5644.
Consistent support for `--quiet` for submit and batch
heyo! I was wondering if we might have consistent support for --quiet
for each of flux submit
and flux batch
, which both can spit out ids. flux run
wouldn’t make sense I don’t think (but maybe there is a use case I’m not thing of). Here is some quick testing with the latest jammy image:
…View Comment
vsoch open issue flux-framework/flux-sched#1128.
Fluxion go bindings: better error message passing
When our current PRs are merged, I’d like to do a PR to make changes similar to here. Basically, when we are developing with the flux go bindings, an error message might look like the following: …View Comment
vsoch open issue flux-framework/flux-sched#1124.
Interface review for resource API
This issue is intended to review the exposed interfaces for the resource API, with the intention to understand what needs to be exposed (and what does not). I’ll tackle this from the standpoint of our primary use case (at least for now) - the fluence plugin that uses the Go bindings and other TBA out of tree plugins that use fluxion. …View Comment
vsoch open issue jsongraph/json-graph-specification#57.
Go bindings?
Hi! I was wondering if you had a go library to provide the basic structure that conforms to the schema. I’ve used this in a few projects now, and (if there isn’t one) would be happy to put it together. …View Comment
vsoch open issue archspec/archspec-json#82.
Please include instructions for install
Hi! Could you please add your intended instruction for install here? When we do just a go get, the submodule isn’t included. E.g., …View Comment
vsoch open issue rse-ops/docker-images#112.
Migrate to radiuss-docker
@davidbeckingsale when https://github.com/LLNL/radiuss-docker is complete could you please archive this repository (turn off actions, etc.) and direct folks over there? Thanks!…View Comment
vsoch open issue snakemake/snakemake-executor-plugin-googlebatch#18.
pi.calc output example is empty
@johanneskoester I am very close to getting this working! The step runs successfully: …View Comment
vsoch open issue spack/spack#41708.
should arm builds work?
I’m attempting my first arm builds with spack - not having luck: …View Comment
vsoch closed issue flux-framework/spack#136.
[package-update] for flux-sched: update-package/flux-sched-2023-11-18
This is a request to open a pull request for a package update….View Comment
vsoch open issue flux-framework/flux-k8s#48.
discussion: version release strategy
When I add a regular test-> deploy (after #47) we will be making regular releases. I want to propose the following release and versioning strategy. …View Comment
vsoch open issue rse-ops/hpc-apps#29.
Look at openradioss
https://www.openradioss.org/…View Comment
vsoch closed issue intel/PerfSpect#63.
Builder removed?
Hi! I’m looking to try this in Kubernetes (with the Daemonset) and noticed that your previous builder https://github.com/intel/PerfSpect/tree/ba15bcf7a28099c0cf38f6687f42c680dabbc69c/builder directory was removed. Was this intentional - is this a good starting base for what is referenced as the perrspect image here: …View Comment
vsoch open issue VClinic/VClinic#1.
Still maintained?
Hi! Is this tool still maintained? I noticed the build.sh doesn’t work and it requires Python 2 (which is deprecated for a bit). Thanks!…View Comment
vsoch open issue flux-framework/flux-sched#1113.
resource-query is missing test.yaml
test.yaml
is referenced for the example, but the test looks like it’s gone?
…View Comment
vsoch open issue flux-framework/flux-operator#211.
Flux Operator Refactor v0.2.0: Read if you want to use the Operator!
We have a WIP pull request that will make several improvements to the Flux Operator, and if you are using for the first time we recommend you use this version. The reason we aren’t merging is to await a paper (that details notes from the previous version). …View Comment
vsoch open issue snakemake/snakemake-executor-plugin-googlebatch#17.
local() file not uploaded to storage
@johanneskoester I was able to get my workflow to run by designating the pi_MPI.c as local, e.g., the input here: …View Comment
vsoch open issue prefix-dev/pixi#557.
Issue with indentation level of pypi installed package
### Checks…View Comment
vsoch closed issue vsoch/pull-request-action#99.
Build failure
Our ci is failing on docker build
task. Anyone else having the same problem?
…View Comment
vsoch open issue snakemake/snakemake#2539.
Snakemake error reporting passing job.output instead of job?
I’ve seen this error a few times - it happens on job error: …View Comment
vsoch open issue flux-framework/flux-k8s#44.
Fluence Refactor with kubernetes-sigs/scheduler-plugin
We want to refactor the build / deploy / testing of Fluence so that: …View Comment
vsoch open issue GoogleCloudPlatform/ramble#332.
Needs for Ramble for Kubernetes
@alecbcs and I chat today about needs for Ramble / Benchpark in the context of cloud (Kubernetes) and since we don’t have a CLA and cannot contribute (yet!) we want to include some of our notes here. …View Comment
vsoch open issue converged-computing/oras-operator#17.
allow operator to deploy without registry
If I provide credentials for an external registry, I don’t need one deployed on the cluster….View Comment
vsoch open issue converged-computing/flux-lima#2.
usernetes needs to increase uid/gid
See https://github.com/rootless-containers/usernetes/issues/311#issuecomment-1825026989. I ran into this issue when creating more pods….View Comment
vsoch open issue kubeflow/training-operator#1949.
Issue with mninst Dockerfile - python too old for pip install
This container base: …View Comment
vsoch closed issue rootless-containers/usernetes#310.
Exiting early?
Apologies if this is a bad question. I’m building VMs with usernetes and flux, and the entire VM does seem to build successfully in that nothing is missing and everything works. The issue I’m hitting is that it always exits early. E.g., I see: …View Comment
vsoch open issue oras-project/oras-go#644.
feature: allow for skip unpack on pull
## Problem …View Comment
vsoch open issue snakemake/snakemake-interface-executor-plugins#43.
Question: looking for means to run a final job step (that will be run locally)
I’m developing the kueue executor, and as a final step I either want to pull the final artifact from the registry, OR tell snakemake that we should not expect output locally (and then instruct the user to pull). I am not able to pull during the completion of the job step because as it is using asyncio I can’t call portforward: …View Comment
vsoch open issue snakemake/snakemake#2507.
Unique identifier for a workflow?
Is your feature request related to a problem? Please describe. …View Comment
vsoch open issue otomato-gh/s3-mounter#8.
s3 mounted in s3-mounter pod, but not in container
Hiya! …View Comment
vsoch open issue converged-computing/oras-operator#9.
Allow specifying multiple input sources
If we are dealing with a DAG, it could be that a step receives input from two previous steps, in which case we would want to be able to pull two artifacts to the same space. This would still assume that they can be pulled and extracted to the same root….View Comment
vsoch open issue snakemake/snakemake#2501.
Bug with storage interface no argument `--storage-http-allow-redirects`
I’m submitting to Google Batch using the latest branches (clones) from the interfaces (storage and executor) and snakemake, and I am hitting this missing argument that is generated by snakemake: …View Comment
vsoch closed issue hariharan-devarajan/dlio-profiler#50.
DLIO not able to see IOR events
I have DLIO running alongside IOR, with the following setup: …View Comment
vsoch open issue hariharan-devarajan/dlio-profiler#50.
DLIO not able to see IOR events
I have DLIO running alongside IOR, with the following setup: …View Comment
vsoch open issue kubernetes/kops#16066.
question: AWS elastic fiber adapter?
Hi! …View Comment
vsoch open issue snakemake/snakemake#2492.
snakemake --help broken?
I just pulled the latest from the main branch, and a basic --help
seems to be broken?
…View Comment
vsoch open issue snakemake/snakemake-interface-executor-plugins#40.
Bug: debug mode cannot be used with non-local execution
I think this is likely a bug? I don’t see why I can’t run self.logger.debug
:
…View Comment
vsoch closed issue flux-framework/spack#108.
[package-update] for flux-sched: update-package/flux-sched-2023-07-12
This is a request to open a pull request for a package update….View Comment
vsoch open issue snakemake/snakemake-interface-executor-plugins#36.
feature: support any argument as a step argument
Following up on #32, I think we should have a function on the executor base class that is able to allow for any value in self.executor_settings to also derive from a step resource. E.g., I just wrote this function: …View Comment
vsoch open issue snakemake/snakemake-wrappers#1875.
Wrappers for Google Batch
I’m working on the batch executor, and there are common snippets (that in retrospect were non-trivial to figure out) for setting up MPI and similar that (I think) would work best as a snakemake wrapper. Adding a note here for myself todo….View Comment
vsoch open issue spack/spack#40420.
flux-security virutal world munge not found
The bug: …View Comment
vsoch open issue snakemake/snakemake-executor-plugin-flux#6.
pypi credential
@johanneskoester what is your strategy for managing the pypi associated projects under the snakemake org? I would rather not generate a credential that is only linked to me, but we can definitely figure something out if that is your preference….View Comment
vsoch open issue converged-computing/metrics-operator#67.
volumes need target containers
Right now volumes are added to all pods in the set, and it needs to be selected….View Comment
vsoch closed issue singularityhub/singularity-hpc#659.
wrapper_base environment variable not expanded
After the addition of wrapper_base option in settings.yml file we noticed that when we use an environment variable in the wrapper_base path the variable is not expanded correctly. Note that we are running shpc install from inside a singularity container. …View Comment
vsoch closed issue singularityhub/sregistry#440.
NGINX Manifest unkown
Hey, …View Comment
vsoch open issue kubernetes-sigs/jobset#290.
Predictible hostnames with one replicated job
I’m testing a JobSet with just one replicated job, and one pod. The predictable hostname has gone away. E.g., what i’d want is: …View Comment
vsoch closed issue flux-framework/spack#110.
[package-update] for flux-core: update-package/flux-core-2023-08-03
This is a request to open a pull request for a package update….View Comment
vsoch closed issue flux-framework/flux-operator#200.
consider using pod affinity to ensure pods are 1:1 to nodes
See https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#more-practical-use-cases - the redis cache example that uses the hostname….View Comment
vsoch closed issue ovis-hpc/ovis#1261.
Containerization?
Hiya! I have a possibly dumb question. I’ve built this into a Docker container (and it seems to work) and I’m wondering what interfaces are used to derive the metrics/data? E.g., I’m wondering if I run this in Kubernetes in a sidecar pod, if the metrics would reflect the entire node the pod is on (or just the pod in question). Thanks!…View Comment
vsoch open issue converged-computing/flux-architecture#2.
Consider decision diagram
We talked about a “install X if Y” decision tree. I’m not sure there is a ton of complexity here, but I wanted to post some links that could be useful if we consider this. …View Comment
vsoch closed issue converged-computing/flux-architecture#1.
Visualization: project circle packing
It might be of interest to visually see where different plugins / modules live within projects. For this I propose a circle packing design, e.g., https://codepen.io/Lea12/pen/ExvoEgx where each top level circle is a project, and within we organize into subgroups. Ideally, each node of the packing should have a link to the thing of interest, and possibly we can have a different way to view the same descriptions. …View Comment
vsoch closed issue converged-computing/google-batch#1.
Need some concept of success policy
What seems to happen more often than not is that the second worker exits, and then the first has the connection close unexpectedly and then is waiting for it to come online. The job then runs indefinitely (I waited until the supposed cutoff time, 1 hour, and it’s still going.) We need to be able to have the worker sleep infinity, but be shut down for good when the first VM is done. I’m not sure how to accomplish that yet - it could be as simple as touching a final “done” final and then having an exit….View Comment
vsoch closed issue LLNL/maestrowf#421.
Invalid version number for flux!
It looks like the library here is linked to specific flux versions: …View Comment
vsoch open issue karljohanw/singularity-mode#1.
Contribute to singularity.lang?
hey @karljohanw ! We are discussing highlighting for Singularity recipes in emacs and stumbled on your work here! https://github.com/singularityhub/singularity.lang/issues/11. The LICENSE is not open to being able to use it, but would you be interested in contributing to our set of syntax highlighters at https://github.com/singularityhub/singularity.lang or minimally just a link in our README? Thank you!…View Comment
vsoch closed issue spack/spack-stack-catalog#7.
Needs local debug
The automated update started failing a few days ago with some issue with the query….View Comment
vsoch open issue flux-framework/flux-restful-api#56.
Bug with caching of credentials
Currently when someone logs in with incorrect credentials, the login is cached and we are locked out. This is a bug that needs investigation and fixing. Pinging @khoing0810 who has expressed interest in working on it!…View Comment
vsoch closed issue converged-computing/metrics-operator#40.
Consider metric app template
For the app-* metrics, I’m starting to see common patterns - there is some number of custom options, and then custom logic to derive entrypoints for a launcher and one or more workers. But the code files are getting very redundant! I’m wondering if there is some way (that would work with the limits of go interfaces) to have common JobSet patterns. In this case the launcher / worker would be a template that has the rest populated by a simpler struct….View Comment
vsoch open issue converged-computing/flux-go#6.
Extend Flux Go to include more examples
Ideally we would provide more examples here for interacting with Flux from Go, and then be able to build cooler apps using the functions! For this issue, I will walk through suggested steps to work on this. …View Comment
vsoch open issue snakemake/snakemake-interface-executor-plugins#13.
Desired for eventual template
We will eventually be putting this into a template for others to make plugins, and desired features include the following: …View Comment
vsoch open issue LLNL/AMG#5.
Thank you!
I just wanted to give huge props and say thank you for how easy this was to build, and find examples for running here! I literally typed make
in an environment with the dependencies (all installed easily with apt) and it worked, and then the example problems did too.
…View Comment
vsoch open issue flux-framework/flux-operator#200.
consider using pod affinity to ensure pods are 1:1 to nodes
See https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#more-practical-use-cases - the redis cache example that uses the hostname….View Comment
vsoch open issue BenWibking/hacc-coral#1.
x86 container?
Heyo! I’m trying to build this into a container - here is what I have: …View Comment
vsoch open issue axboe/fio#1610.
[possible bug] not working as expected?
Please acknowledge you have done the following before creating a ticket …View Comment
vsoch open issue kellyjonbrazil/jc#441.
pidstat bug with two output tables
There seems to be a bug with the pidstat parser that it expects one header for the table. As an example in the output below, adding -T ALL
is going to show us the chosen task plus global set (see man pages here) but what jc seems to be doing is using the top level header, and then shoving the second row of values into it.
…View Comment
vsoch open issue converged-computing/flux-architecture#1.
Visualization: project circle packing
It might be of interest to visually see where different plugins / modules live within projects. For this I propose a circle packing design, e.g., https://codepen.io/Lea12/pen/ExvoEgx where each top level circle is a project, and within we organize into subgroups. Ideally, each node of the packing should have a link to the thing of interest, and possibly we can have a different way to view the same descriptions. …View Comment
vsoch open issue kubernetes-sigs/jobset#237.
JobSet release for ARM?
I’m testing JobSet with an operator, and I saw this error in my operator …View Comment
vsoch open issue kubernetes-sigs/jobset#231.
pytorch example container is too chonky
I’m testing out the pytorch example: …View Comment
vsoch closed issue converged-computing/hyperqueue-operator#4.
Add arm build
We will need an arm build of the operator and lammps to run experiments….View Comment
vsoch open issue It4innovations/hyperqueue#609.
ARM build?
Heyo! Any chance we could add an ARM build to the set? …View Comment
vsoch open issue GoogleCloudPlatform/scientific-computing-examples#60.
allnode example nfs mount variables are different
hey @wardharold ! Just a heads up for this example: …View Comment
vsoch open issue cue-labs/oci#9.
Planning and design?
Heyo!
vsoch closed issue flux-framework/spack#106.
[package-update] for flux-core: update-package/flux-core-2023-07-08
This is a request to open a pull request for a package update….View Comment
vsoch open issue snakemake/snakemake#2351.
support for ORAS artifacts as a form of storage
ORAS (ocI registry as storage) is a means to put arbitrary data blobs in an OCI (aka Docker) registry. In the case of Singularity, this could also be a container to pull (this issue) however this use case is different! https://github.com/snakemake/snakemake/issues/1263. We ideally could allow for someone to specify an oras remote for data, e.g., instead of: …View Comment
vsoch open issue snakemake/snakemake#2349.
Custom attributes?
The documentation here: https://snakemake.readthedocs.io/en/stable/snakefiles/rules.html#resources suggests that we can add arbitrarily named resources: …View Comment
vsoch open issue converged-computing/flux-burst#8.
Workaround for flux KVS
We are currently tracking: https://github.com/flux-framework/flux-core/issues/5308 …View Comment
vsoch open issue flyteorg/flyte#3829.
[Plugin] Flux Operator
Hi! :wave: I develop the Flux operator https://flux-framework.org/flux-operator/ which conceptually is like the MPI operator, but it brings up a Flux Framework cluster (that acts as a job) to run a scoped piece of work, akin the MPI operator. I’m interested in adding it as a plugin (and can also do the development work for it) but I wanted to check first about the order of operations. …View Comment
vsoch open issue GoogleCloudPlatform/scientific-computing-examples#59.
Suggestion: don't require cloud build
I tried cloud build again today (I’m working on a PR that will allow a custom broker config, likely will put in soon!) and I ran into the same mysterious error as before. If this extra dependency is going to be something that regularly breaks, I want to suggest having an approach that uses packer without needing cloud build. The recipes I’m currently using are here: …View Comment
vsoch open issue flyteorg/flyte#3821.
[BUG] flytectl demo start progress bar is broken?
### Describe the bug…View Comment
vsoch open issue sphinx-doc/sphinx#11479.
bug with newer pydantic
### Describe the bug…View Comment
vsoch open issue hpc-social/jobs#9.
Looking to submit a job? :eyes:
Hey there! :wave: thanks for stopping by! If you are looking to post a job, we ask that you use our jobs form on the website: …View Comment
vsoch closed issue converged-computing/flux-metrics-api#3.
On the fly custom metrics or calculations
We would want to allow the user to define “on the fly” metrics or rules….View Comment
vsoch open issue bbockelm/htcondor-autoscale-manager#1.
Question about token + dedicated hosts deployment
Hey @bbockelm ! Matt West directed me to your repository (woot autoscaling!) and that you suggested taking a token approach to setting up my HTCondor cluster. The list hasn’t been super responsive so I’m hoping we might chat here (and if you have some wisdom to share it would be greatly appreciated!) …View Comment
vsoch open issue kubeflow/pipelines#9678.
cannot save parameter /tmp/outputs/condition
I’ve created a KubeFlow pipelines deployment on GKE with the manifests directly from the repository (the ones in the tutorial combined with code examples here led to validation errors). You can see the exact steps I’m taking to deploy the cluster, install KubeFlow, generate the compile / compile / install here: https://github.com/converged-computing/flux-operator-component/tree/add/component#kubeflow-on-gke …View Comment
vsoch closed issue singularityhub/singularityhub.github.io#244.
github webhook isnt working
### Links …View Comment
vsoch open issue kubernetes-sigs/kubebuilder#3454.
Documentation request: e2e test with webhook
Hi Kubebuilder developers! I am looking for an example of an e2e test that also includes adding the webhook. We have some logic that is set in the webhook, and would like it to be there for the e2e test. An example adding to the testEnv might look like this: …View Comment
vsoch open issue converged-computing/kubescaler#4.
Create better install logic for aws vs google
Right now they are both attempted to be imported, so having them distinct isn’t meaningful. We either need to: …View Comment
vsoch closed issue GoogleCloudPlatform/batch-samples#33.
Networking between parallel instances?
Hiya! We were wanting to use a tool that will assess networking between instances. Is this possible, or when we set parallelism it is running the same thing in parallel N times (but the nodes are not connected?) Thanks!…View Comment
vsoch closed issue terraform-aws-modules/terraform-aws-ec2-instance#335.
[Question] setup with networked EC2 instances
Hi! :wave: I’m new to terraform, and I’m looking to create a set of EC2 instances that can see one another. I’ve done this on Google Cloud, and for that strategy I use a common module to setup networking, and then I have some logic in the startup script to use the Google metadata API to get the ips for other instances. I’m looking for a similar setup for AWS - and this looks really close to what I’m interested in (but I have some questions!) …View Comment
vsoch closed issue flux-framework/spack#93.
Add flux-security
Three steps for changes: …View Comment
vsoch open issue GoogleCloudPlatform/ramble#164.
Question: support for workflows outside of spack?
Hiya! Is there support (or planned support) for being able to define an execute workflows with other kinds of package managers? E.g., one very easy family would be something like conda/mamba to install a bunch of python machine learning libraries, which aren’t handled as well under spack. Thanks!…View Comment
vsoch open issue GoogleCloudPlatform/batch-samples#33.
Networking between parallel instances?
Hiya! We were wanting to use a tool that will assess networking between instances. Is this possible, or when we set parallelism it is running the same thing in parallel N times (but the nodes are not connected?) Thanks!…View Comment
vsoch open issue GoogleCloudPlatform/ramble#159.
Hostname experiment is missing space!
Hiya! I’m trying out ramble for the first time today, and it looks like the example experiment for hostname is missing a space between “time” and “mpirun” e.g., see the last line: …View Comment
vsoch open issue kubernetes-sigs/kind#3265.
Possible Documentation Bug: ingress example
Hi! I’m following the ingress example here and when applying the Kong patch, I get: …View Comment
vsoch open issue odise/terraform-aws-ec2-instance#27.
Multiple instance example?
Hi! Do you have an example to create more than one instance with DNS or an ability for them to ping one another? Thanks!…View Comment
vsoch closed issue flux-framework/flux-core#5225.
[user-interface] flux broker vs. flux start and config file
Hiya! I wanted to bring up a point of discussion - why the flux broker
vs flux start
have slightly different interfaces for specifying the config file:
…View Comment
vsoch open issue rootless-containers/usernetes#281.
Question: support for multiple hosts?
Hi! I have a simple question I didn’t see obviously in the README or doing a quick search - does usernetes support multiple hosts or does it assume running on one host? I saw it is using ` slirp4netns.` which seems to be the same (and thus might result in the same problem) as I’m hitting with k3s https://github.com/k3s-io/k3s/discussions/7615#discussioncomment-6016006. I also see that k3s uses usernetes? So maybe it’s exactly the same problem! …View Comment
vsoch open issue k3s-io/k3s#7614.
k3s rootless - two step install
Hi! I’m trying to prepare a set of VMs with k3s, and have everything that I need to (later) allow a user to login and setup rootless. With the current suggested workflow, namely calling the install script in one of two ways: …View Comment
vsoch open issue GoogleCloudPlatform/scientific-computing-examples#55.
hpc-tools integration
I was playing around with Google Batch today, and in one of the examples it used an executable from here to install intel MPI: …View Comment
vsoch open issue GoogleCloudPlatform/batch-samples#29.
Issues with batch MPI examples
Hi! …View Comment
vsoch open issue flux-framework/PerfFlowAspect#112.
Trouble reproducing build
Hi! I’m trying to follow the logic in the GitHub workflow and I have this Dockerfile: …View Comment
vsoch open issue GoogleCloudPlatform/hpc-tools#2.
Install mpicc?
Hi! I’m trying to install intel MPI (runtime and compiler) but I can’t seem to find mpicc. I’m doing: …View Comment
vsoch open issue bids-apps/maintenance-tools#6.
Docker Hub Manifests are Unknown
Hi! Similar to other bids apps (e.g., https://github.com/BIDS-Apps/aa/issues/6) all of the digests on Docker Hub are unknown. …View Comment
vsoch open issue GoogleCloudPlatform/scientific-computing-examples#53.
Error if policy exists
In the case that a user runs a particular config twice, I’ve hit this bug (twice now) that it tells me the collocation policy (I think this is for compact) already exists: …View Comment
vsoch closed issue flux-framework/spack#97.
[package-update] for flux-core: update-package/flux-core-2023-05-16
This is a request to open a pull request for a package update….View Comment
vsoch open issue anvaka/map-of-github#6.
Label for HPCLand
Hi! I was wondering if it’s possible to suggest labels for different groups? Specifically here: …View Comment
vsoch open issue cloudposse/terraform-aws-ec2-autoscale-group#109.
Support for lifecycle hooks
## Describe the Feature …View Comment
vsoch open issue zoitech/terraform-aws-network#58.
Example of using this module with EC2 instances?
Hi! :wave: I’m new to terraform, and I’m looking to create a set of EC2 instances that can see one another. I’ve done this on Google Cloud, and for that strategy I use a common module to setup networking, and then I have some logic in the startup script to use the Google metadata API to get the ips for other instances. I’m looking for a similar setup for AWS. To be specific, my questions are: …View Comment
vsoch open issue terraform-aws-modules/terraform-aws-ec2-instance#335.
[Question] setup with networked EC2 instances
Hi! :wave: I’m new to terraform, and I’m looking to create a set of EC2 instances that can see one another. I’ve done this on Google Cloud, and for that strategy I use a common module to setup networking, and then I have some logic in the startup script to use the Google metadata API to get the ips for other instances. I’m looking for a similar setup for AWS - and this looks really close to what I’m interested in (but I have some questions!) …View Comment
vsoch open issue infrablocks/terraform-aws-base-networking#78.
Example adding instances from EC2?
Hi! :wave: I’m new to terraform, and I’m looking to create a set of EC2 instances that can see one another. I’ve done this on Google Cloud, and for that strategy I use a common module to setup networking, and then I have some logic in the startup script to use the Google metadata API to get the ips for other instances. I’m looking for a similar setup for AWS. To be specific, my questions are: …View Comment
vsoch open issue cn-terraform/terraform-aws-networking#22.
Example with custom instances?
Hiya! I’m new to terraform, and I’m looking to create a set of EC2 instances that can see one another. I’ve done this on Google Cloud, and for that strategy I’ll have some logic in the startup script to use the Google metadata API to get the ips for other instances. So I think my questions are two-fold: …View Comment
vsoch open issue aws-samples/aws-network-hub-for-terraform#29.
[Question] custom setup with group of instances?
Hi! :wave: I’m new to terraform, and I’m looking to create a set of EC2 instances that can see one another. I’ve done this on Google Cloud, and for that strategy I use a common module to setup networking, and then I have some logic in the startup script to use the Google metadata API to get the ips for other instances. I’m looking for a similar setup for AWS. To be specific, my questions are: …View Comment
vsoch closed issue kubernetes-sigs/jobset#131.
JobSet usage in another operator
Hiya! I’ve started experimenting with using JobSet from another operator (the Flux Operator) and finished most of the implementation (REF, but when I got up to testing it (e.g., I need to still check how the new headless service that is created for me works to update the broker config) but I ran into some issues. What I’m doing is: …View Comment
vsoch closed issue flux-framework/flux-operator#151.
Worker pod termination
We currently don’t take any steps to terminate worker pods when the broker stops, and ideally we would be able to flag all the worker pods as Completed when the broker pod is. If anyone has ideas please comment here. This issue was originally discussed (with a few ideas) here: https://github.com/kubernetes-sigs/kueue/issues/716…View Comment
vsoch closed issue tainagdcoleman/wfchef#1.
Example workflows?
Hiya! I just saw your paper https://www.sciencedirect.com/science/article/abs/pii/S0167739X23001711 (congrats!) and was wondering if this is the right repository for wfchef, and if there are any tutorials to try it out?…View Comment
vsoch open issue tainagdcoleman/wfchef#1.
Example workflows?
Hiya! I just saw your paper https://www.sciencedirect.com/science/article/abs/pii/S0167739X23001711 (congrats!) and was wondering if this is the right repository for wfchef, and if there are any tutorials to try it out?…View Comment
vsoch open issue bentoml/BentoML#3836.
BentoML Integration with the Flux Operator
hi! I’m a developer for the Flux Operator, which (from a high level) let’s you deploy an HPC (supercomputer) cluster on Kubernetes. It comes down to a set of networked pods (via an indexed job) and we’ve been able to run distributed setups on it from Tensorflow, Pytorch, etc. …View Comment
vsoch open issue kubernetes/kubernetes#117819.
Network readiness for indexed job + headless service depends on presence of pod
### What happened?…View Comment
vsoch open issue flux-framework/cheat-sheet#6.
Add support for dark and light modes
We would want to reproduce what we have under https://flux-framework.org. This means the steps to do this would be: …View Comment
vsoch open issue zeromq/czmq#2263.
Saving certificate to memory?
Hi! I’m using zeromq from within a Kubernetes operator to generate a configmap to write / provide to pods. I noticed here: https://github.com/zeromq/czmq/blob/0b3a4c5e5c308b3d81ecb73fdf0030c150af3ed6/src/zcert.c#L18-L19 there is a note about saving to memory, and I was wondering if there is an example / test that shows how to do that? Ideally we would not need to save these paths to the filesystem of the operator. Thank you!…View Comment
vsoch closed issue singularityhub/singularity-hpc#598.
Customise container naming
Is your feature request related to a problem? Please describe. …View Comment
vsoch closed issue singularityhub/singularity-hpc#644.
container_base with podman issue at perlmutter
we are using podman
on Perlmutter and currently podman is not setup to automatically download containers. @scanon had suggested i create a storage.conf
file in my $HOME/.config/containers/storage.conf where we specify graphroot
to location where containers are stored
…View Comment
vsoch open issue PrefectHQ/prefect#9315.
Support for Flux Framework via Flux Operator
### First check…View Comment
vsoch open issue flux-framework/kubeflux#1.
Repository ever used?
I was browsing Eduardo’s repos and noticed a link to this repository - it looks like maybe it was created and then never used (because the code wound up under an openshift + flux-k8s repo? …View Comment
vsoch open issue icl-utk-edu/hpcc#4.
Install in ubuntu container - lirc?
Hi! I’m trying to run make arch=linux
in a container - I randomly chose an ubuntu base but I’m not able to get beyond this:
…View Comment
vsoch open issue converged-computing/oras-csi#15.
feature: cleanup options
It might be the case that we want an artifact to be cleaned up after the fact, or I can see some kind of cleanup logic implemented on the level of the driver itself. Thus, we should brainstorm an ability to specify conditions for cleaning up an artifact when the kubernetes pod / object is deleted, both on the level of the pod and the driver itself….View Comment
vsoch open issue PrefectHQ/prefect#9181.
Workflow (flow?) catalog or similar?
### First check…View Comment
vsoch closed issue flux-framework/flux-pmix#83.
Spack Build Error for pmix TASKMAP_ENCODE_RAW_DERANGED undeclared
Hiya! Were there any recent changes that might cause this error? …View Comment
vsoch closed issue alpha-unito/streamflow#88.
Design question about using DAG for Kubernetes Operator
Hi again! We have Kubernetes Operator that brings up an entire Flux Framework “MiniCluster” - https://flux-framework.org/flux-operator/ and I’m just starting to develop an ability to save state between MiniClusters (basic example) so we could do something that run a subset of jobs for a workflow, save state, and then bring up a different resource. …View Comment
vsoch open issue GoogleCloudPlatform/gcs-fuse-csi-driver#8.
Driver Not Working
Hi again! I am trying to debug why my pods are stuck in pending. Here is a bunch of logs / configs that might shed some light. …View Comment
vsoch open issue flux-framework/flux-pmix#83.
Spack Build Error for pmix TASKMAP_ENCODE_RAW_DERANGED undeclared
Hiya! Were there any recent changes that might cause this error? …View Comment
vsoch closed issue flux-framework/spack#85.
[package-update] for flux-sched: update-package/flux-sched-2023-04-01
This is a request to open a pull request for a package update….View Comment
vsoch open issue charmbracelet/lipgloss#190.
Example to toggle full screen?
Hi! I have a keymap that I want to indicate full screen, and my code under the view Update() looks like: …View Comment
vsoch closed issue weaveworks/eksctl#6222.
[Bug] Issue with efa device plugin running as root
Hi! I opened the issue here https://github.com/aws-samples/aws-efa-eks/issues/8 so they can be tracked in sync. I just updated my version of eksctl and it pulled in the new changes, and we started seeing the issue I’ll report here. We are creating an EKS cluster with eksctl, specifically like this: …View Comment
vsoch open issue funcx-faas/funcX#1077.
FluxExecutor exposed as endpoint executor?
Hi! I was looking to set up a funcx endpoint, where the funcx-endpoint start X
command would be run by the flux broker, thus giving it access to an entire cluster in Kubernetes. But I was looking here: https://github.com/funcx-faas/funcX/blob/main/funcx_endpoint/funcx_endpoint/executors/init.py and I don’t see it as an option. Is this something we could potentially add?
…View Comment
vsoch open issue charmbracelet/wishlist#171.
Question about soft serve
Hi! I like the design of the navigation sidebar and a main viewport here: …View Comment
vsoch closed issue aws-samples/aws-efa-eks#8.
Issue with container running as root
Hiya! I found this repository because I’m creating an EKS cluster with eksctl, specifically like this: …View Comment
vsoch open issue LLNL/merlin#413.
Question: where does singularity openfoam image get pulled?
I’m looking at this tutorial https://github.com/LLNL/merlin/blob/7e07b72b969afc2252a892d9658a4854f97e4bb9/merlin/examples/workflows/openfoam_wf_singularity/openfoam_wf.yaml#L13 and I’m wondering if the user is responsible for pulling the container to the running directory (alongside scripts) and if so, are you using one of these images? https://hub.docker.com/u/openfoam…View Comment
vsoch closed issue MLMI2-CSSI/foundry#321.
globus: seems to be hard requirement
I was hoping to run some examples in an operator I’m working on, but it would be hard to create a Globus endpoint there. Is there any use case for foundry that does not require globus?…View Comment
vsoch open issue GoogleCloudPlatform/scientific-computing-examples#48.
Testing flux framework basic
Hey @wardharold ! I’ve taken a preliminary look at the setup and was able to do the builds, but on the terraform “basic” example when I tried to init my basic.vars: …View Comment
vsoch open issue MLMI2-CSSI/foundry#321.
question: is globus a hard requirement?
I was hoping to run some examples in an operator I’m working on, but it would be hard to create a Globus endpoint there. Is there any use case for foundry that does not require globus?…View Comment
vsoch closed issue flux-framework/flux-operator#80.
Support for helm chart
I think the mpi-operator has one - https://github.com/kubeflow/mpi-operator/tree/c21942d1e27d196d06078f67f39a6f7a0e990dee/hack/helm and this would be really useful to have here too, e.g., for cases when we want to further customize our own defaults….View Comment
vsoch open issue awslabs/mountpoint-s3#180.
Usage in Kubernetes?
Hi! Has anyone tried this in the context of Kubernetes? I could create a daemon set to install the needed deps for the nodes, and the main thing I think I’m not sure how to do is to ensure there are credentials there for doing the bind. Has anyone tried this / has experience and could make some suggestions? Thank you!…View Comment
vsoch closed issue flux-framework/flux-operator#27.
Proper Test Suite
When we are somewhat happy with the initial proof of concept design, let’s look into best practices for testing and ensure we write tests….View Comment
vsoch open issue nextflow-io/nf-wave#7.
Suggested archive mechanism - from registry using ORAS
Hiya! Instead of requiring an https address for a .tar.gz, I want to suggest looking at ORAS: https://oras.land/ …View Comment
vsoch open issue exalearn/colmena#97.
Potential issue between FluxExecutor and colmena
Hi there! We’ve been trying to run the example workflow from ExaWorks (modifying to use the FluxExecutor) and there seems to be a bug - reported in detail here: https://github.com/ExaWorks/molecular-design-parsl-demo/issues/2#issuecomment-1474295903 that it hangs. It would be nice to get this working to have full examples using the Flux Operator, and as suggested by @WardLT we might start with simple test cases (and use the FluxExecutor). Let me know how I can be of help….View Comment
vsoch closed issue exalearn/colmena#96.
make_queue_pairs removed?
Hi! I’m using a script from 8 months ago with: …View Comment
vsoch open issue CEED/Laghos#174.
Laghos in a workflow?
Hi! I was wondering if you have any example workflows that use Laghos that are beyond a single command (e.g., the mpirun examples at the bottom of the repository). We are looking for something that has a bit of complexity in terms of workflow steps. Thanks!…View Comment
vsoch closed issue jbms/sphinx-immaterial#235.
Search missing most content
Hiya! I’m using your theme at https://flux-framework.org/flux-operator (love it!) and I’ve noticed that the search isn’t showing most of the content (e.g., browse the site, find a term, search for it, no results?) …View Comment
vsoch open issue alpha-unito/streamflow#88.
Design question about Kubernetes (helm) executor
Hi again! We have Kubernetes Operator that brings up an entire Flux Framework “MiniCluster” - https://flux-framework.org/flux-operator/ and I’m just starting to develop an ability to save state between MiniClusters (basic example) so we could do something that run a subset of jobs for a workflow, save state, and then bring up a different resource. …View Comment
vsoch closed issue rook/rook#11481.
[question] integration with another operator
Hiya! I think I could open this on the slack, but since it’s the holiday I thought this might be better to catch someone in the new year. Basically, I’m interested in a storage solution for an operator I’m developing for Flux, meaning we can run HPC-type workloads in Kubernetes https://github.com/flux-framework/flux-operator. I’ve mostly been doing local development with MiniKube, and of course in this context a local host volume works fine, but today I tried for the first time to deploy to GKE, and the persistent volume claim (that needs to be ReadWriteMany) of course failed! I’m looking into robust solutions to deploy storage (ideally controlled by our operator to avoid complexity) and most of what I’ve seen talks about setting up NFS https://medium.com/@Sushil_Kumar/readwritemany-persistent-volumes-in-google-kubernetes-engine-a0b93e203180. I saw a suggestion for Rook here https://discuss.kubernetes.io/t/how-to-setup-a-readwritemany-and-readwriteonce-storageclass/12097 and while it looks like this is an operator in and of itself (which might not be ideal to require two operators) I thought it might be that someone else had come here with a similar need and I’d open up an issue for discussion! Is there some use case for rook, or an underlying volume type / service that rook uses, that we could use in the Flux Operator? Thanks for your wisdom - and apologies if this is super noob question I’m new to developing for Kubernetes (and really loving it so far!)…View Comment
vsoch open issue flux-framework/flux-operator#100.
Certificate generation container
There isn’t any compelling reason we can’t use a consistent container to generate the certificate….View Comment
vsoch closed issue converged-computing/playground#5.
Add testing action
A user that develops a tutorial should be able to run a testing action alongside it….View Comment
vsoch closed issue kubernetes/kube-openapi#376.
How to specify a default?
hi! I’m generating the Go output for an operator-sdk API (generated with kubebuilder), and everything seems to look okay but my defaults are not passed forward. As an example, here I have a default (defined with a kubebuilder tag) for a timeout in seconds: …View Comment
vsoch open issue flux-framework/flux-core#4957.
"Bubble up" of flux mini to top level commands
As discussed, our first low hanging fruit is to move the flux mini commands up to the top level group, ideally leaving the old functionality and putting a deprecation notice. Some of that vision is outlined here: https://github.com/vsoch/flux-commands and I’ll follow up with the start of a pull request. @alecbcs was involved in creating that document (and I think still interested in helping?) so I’ll ping him here too. …View Comment
vsoch open issue yandex-cloud/k8s-csi-s3#40.
Issue getting this setup!
Hi! I’ve been trying a few days to get this working, and aside from probably being an idiot I suspect I’m doing something wrong. My use case is that I have an existing bucket that I want to attach pods to (in an indexed job) to use the workflow flies. What I’ve tried: …View Comment
vsoch closed issue flux-framework/flux-operator#57.
Enchancement: better control of loggers
Currently, when we start with four (or more) workers, we see the equivalent in logging: …View Comment
vsoch closed issue flux-framework/flux-operator#50.
Consider making -E with PATH and PYTHONPATH default
We are currently running flux start as follows: …View Comment
vsoch open issue flux-framework/flux-operator#82.
Write summary document
I like how this is presented: https://github.com/kubeflow/mpi-operator/blob/master/proposals/scalable-robust-operator.md…View Comment
vsoch open issue flux-framework/flux-operator#81.
Timing Improvements
We can create custom events / conditions within the operator, and then easily save this with output (as yaml) to parse later in flux-cloud. We will eventually want a better high level solution for logging (e.g., getting from flux possibly) but this should give us more fine-grained detail in the interim. …View Comment
vsoch open issue flux-framework/flux-workflow-examples#91.
Meta issue for new examples
I’m going to be porting a lot of the subdirectories here to be sphinx gallery examples, and I’ll make a list (and check them off) as I go just so I can keep track! When that’s done I think this repository can be archived. Still setting things up locally - list coming soon!…View Comment
vsoch closed issue ofek/csi-gcs#154.
Question about "csi" directive in PersistentVolume
Hiya! I’m new to setting this up, so I apologize in advance for the silly question. I have my storage set up, the secret created, and now I’m adding the logic for my operator to create the same as the pv.yaml and pvc.yaml. But I’m looking at the section here: …View Comment
vsoch closed issue flux-framework/spack#72.
[package-update] for flux-core: update-package/flux-core-2023-02-03
This is a request to open a pull request for a package update….View Comment
vsoch open issue deinstapel/k8s-rook-ceph#1.
Updated for 2023?
Hiya! I am wanting to do storage in a shared namespace, and I’m hoping this recipe would still work with the current version of their yaml files in the rook ceph repository? I tried following their instructions to make storage and it was really unclear and am looking for other ways. Thank you!…View Comment
vsoch open issue flux-framework/flux-k8s#35.
Provenance for LAMMPS efa image
In case anyone needs it, this Dockerfile that builds to this container is a variant of spack v0.17, specifically commit 7893be7712ed709f6136ac83f49afc3d718d5ddc
….View Comment
vsoch closed issue flux-framework/spack#68.
[package-update] for flux-core: update-package/flux-core-2023-01-25
This is a request to open a pull request for a package update….View Comment
vsoch closed issue flux-framework/flux-operator#70.
Have automated deploy for the build config
Currently I run make build-config
to update the flux-operator.yaml (to install it to a cluster) in examples/dist. However, this is error prone - I could forgot! So either we should have a check in the CI that the user has run it (and fail if the two files are different) or have a different deploy strategy….View Comment
vsoch closed issue jbms/sphinx-immaterial#208.
Bugs with search when deployed to readthedocs
Hiya! Everything in the theme works great locally, and on GitHub pages, but on readthedocs I see: …View Comment
vsoch open issue flux-framework/flux-operator#66.
Add debugging for resources, limits and requests
The verbosity of adding these should be upped a bit, whenever I work on this next….View Comment
vsoch closed issue weaveworks/eksctl#6116.
[Feature] elastic fiber adapter (efa) from the command line
### What feature/behavior/change do you want? …View Comment
vsoch open issue crate-ci/typos#651.
Ignore regular expression
We have job identifiers that look like this: …View Comment
vsoch open issue flux-framework/flux-operator#63.
Refactor docs design to match new design decision for Flux projects!
vsoch open issue mmore500/presentation-template#5.
Rotate credentials
Heyo! Just wanted to check that you rotated the circle credentials - I can see docker credentials (not the credentials just that they are there) and over xmas break Circle announced we should do this because apparently they had an incident….View Comment
vsoch open issue NeuroVault/NeuroVault#769.
Rotate credentials for Circle CI
Heyo! Just wanted to check that you rotated the circle credentials - I can see docker credentials (not the credentials just that they are there) and over xmas break Circle announced we should do this because apparently they had an incident. …View Comment
vsoch open issue AICONSlab/MIRACL#83.
Check for rotating circle credentials
Heyo! Just wanted to check that you rotated the circle credentials - I can see docker credentials (not the credentials just that they are there) and over xmas break Circle announced we should do this because apparently they had an incident….View Comment
vsoch open issue bashtage/sphinx-material#129.
High contrast color scheme option
Hi there! We are deploying this template (love it btw, thank you!) to some of our sites, and are worried that the colors are a bit light. We see there is an issue open for a dark mode #89 (which would be amazing) and we are also wondering if there could be modes / options for higher contrast colors? E.g., here is an example from my group to compare traditional sphinx to the defaults here: …View Comment
vsoch open issue StanfordVL/iGibson#255.
AWS tokens in releases
In case you didn’t see this - I think this is your project? …View Comment
vsoch open issue LLNL/radiuss#54.
Support for dark mode
Akin to Flux’s! https://flux-framework.org…View Comment
vsoch closed issue spack/spack#34818.
All builds failing "no module named macholib"
Tonight all our flux builds failed with …View Comment
vsoch closed issue rseng/jobs-updater#7.
Allow user to specify an existing file
For hpc.social, we want the jobs bot to run right after an update, and before a commit is made. Even if we made the commit, a bot cannot trigger another bot! So our script will generate an older and updated file, and then provide them to the updater to trigger on the same job that is doing the update. I’ll do this tomorrow, then add the workflow to https://github.com/hpc-social/jobs….View Comment
vsoch open issue weaveworks/eksctl#6116.
[Feature] elastic fiber adapter (efa) from the command line
vsoch open issue converged-computing/flux-cloud#5.
add parameter for number of times to run (defaults to 1)
e.g., we likely want to get variance / std for a run, and especially if it’s quick (e.g., lammps is 20 seconds) it makes sense to run more than once!…View Comment
vsoch open issue flux-framework/flux-security#159.
[docs portal] repository needs tags!
See …View Comment
vsoch open issue flux-framework/flux-pmix#75.
[docs portal] repository needs tags!
See …View Comment
vsoch open issue flux-framework/flux-accounting#305.
[docs portal] repository needs tags!
See …View Comment
vsoch open issue vsoch/uptodate#62.
Issue parsing commits data structure
This needs to be debugged https://github.com/rse-ops/docker-images/actions/runs/3797808483/jobs/6459048819. It’s infrequent so I’m not going to jump on it asap….View Comment
vsoch open issue flux-framework/flux-operator#58.
flux operator submit library
I’m working on a small set of scripts that will make it easy to bring clusters up/down for different clouds (starting with GCP for our experiments) and I’m not sure where to put it yet - since I can’t make repos here I’ll likely put it under rse-ops for the time being. It might have integration with cloud-select so I might also put it under the converged computing organization. Some prototype thinking (shared on slack): …View Comment
vsoch open issue flux-framework/flux-docs#182.
comparison table needed front and center!
Re: https://twitter.com/multimeric/status/1608314794771353601?s=46&t=TpURSWMUAkzhMLATTQF1UQ …View Comment
vsoch closed issue singularityhub/shpc-registry-cache#2.
Allow to run for few weeks, when comfortable, direct push
vsoch closed issue converged-computing/cloud-select#19.
Add exponential backoff to API calls for aws
E.g., catch this: …View Comment
vsoch open issue rook/rook#11481.
[question] integration with another operator
Hiya! I think I could open this on the slack, but since it’s the holiday I thought this might be better to catch someone in the new year. Basically, I’m interested in a storage solution for an operator I’m developing for Flux, meaning we can run HPC-type workloads in Kubernetes https://github.com/flux-framework/flux-operator. I’ve mostly been doing local development with MiniKube, and of course in this context a local host volume works fine, but today I tried for the first time to deploy to GKE, and the persistent volume claim (that needs to be ReadWriteMany) of course failed! I’m looking into robust solutions to deploy storage (ideally controlled by our operator to avoid complexity) and most of what I’ve seen talks about setting up NFS https://medium.com/@Sushil_Kumar/readwritemany-persistent-volumes-in-google-kubernetes-engine-a0b93e203180. I saw a suggestion for Rook here https://discuss.kubernetes.io/t/how-to-setup-a-readwritemany-and-readwriteonce-storageclass/12097 and while it looks like this is an operator in and of itself (which might not be ideal to require two operators) I thought it might be that someone else had come here with a similar need and I’d open up an issue for discussion! Is there some use case for rook, or an underlying volume type / service that rook uses, that we could use in the Flux Operator? Thanks for your wisdom - and apologies if this is super noob question I’m new to developing for Kubernetes (and really loving it so far!)…View Comment
vsoch open issue spack/spack#34689.
spack install texinfo fails
I am getting this error: …View Comment
vsoch open issue converged-computing/cloud-select#19.
Add exponential backoff to API calls for aws
E.g., catch this: …View Comment
vsoch closed issue urlstechie/urlchecker-action#99.
fake_useragent error
Hey guys! …View Comment
vsoch closed issue pydicom/deid#249.
Possible incorrect coordinate definition
While reviewing the default deid.dicom file I noticed one greylist filter entry which I believe was incorrectly changed from coordinates to ctpcoordinates. …View Comment
vsoch open issue cbcrg/nf-benchmark#31.
Question about use cases!
Hiya! I’m wondering if this is a good module to use to benchmark the same workflow in different contexts? Or is it intended to give stats for a particular workflow regardless of location? I was looking for benchmarking workflows in nextflow and having some trouble finding them - but maybe a ML workflow with something like this could work? :thinking: …View Comment
vsoch closed issue flux-framework/spack#61.
[package-update] for flux-core: update-package/flux-core-2022-12-12
This is a request to open a pull request for a package update….View Comment
vsoch open issue rse-ops/nix#13.
Explore idea of GitHub packages nix cache
vsoch open issue flux-framework/spack#56.
flux-core build failing tonight
Will look into tomorrow. https://github.com/flux-framework/spack/actions/runs/3616888046/jobs/6096151257#step:18:1009…View Comment
vsoch open issue DavHau/mach-nix#526.
Help adding bioconda channel
Hi! I’m trying to add bioconda to my flake.nix, but this fails: …View Comment
vsoch open issue nix-community/nur-combined#8.
Issue with server on heroku?
Hiya! I updated some repositories last night and I think there might be an issue pinging NUR here? E.g.,: …View Comment
vsoch open issue rse-ops/devbox#7.
Add docs for adding a nur repository to devenv
See https://github.com/cachix/devenv/tree/main/examples/nur…View Comment
vsoch closed issue hpc-social/hpc-social.github.io#49.
Add our podcast?
Hi– I’m Sarah Webb at the Krell Institute. We just joined the community as @sciencemachines. …View Comment
vsoch closed issue XiaohuaZhangLLNL/conveyorlc#5.
New Release?
Hiya! I’m trying to make this into a nix package, and ideally I could do that from a release! I noticed there hasn’t been a release since 2018 - would it be possible to draft one soonish? :thinking: …View Comment
vsoch open issue USRSE/blog#38.
CircleCI Workflow update to GitHub Actions
Hi USRSE! The repository here is deploying via my bot’s token, and I can no longer provide this and want to suggest you update to perform the push in GitHub actions. Don’t worry - I already wrote the workflow: https://github.com/hpc-social/blog/blob/main/.github/workflows/update-posts.yaml just need someone to add it here. I’ll revoke my bot permissions at the end of the week, but let me know if you need more time. Thanks!…View Comment
vsoch closed issue hpc-social/hpc-social.github.io#1.
Preview for site - Netlify?
@alansill I’m a huge fan of Netlify for previewing sites - I have my own free account but I wonder if HPC social could have it’s own free account? https://www.netlify.com/pricing/ The interactions with GitHub issues / PRs are seriously beautiful - I tested on US-RSE but couldn’t keep it because our build frequency (e.g., common updates to jobs) would go over the free tier. …View Comment
vsoch open issue nextflow-io/nextflow#3429.
Bug translating nextflow.config nested "flux" variables into workflow
## Bug report …View Comment
vsoch open issue nextflow-io/nextflow#3419.
Flux output warning
## Bug report …View Comment
vsoch open issue LLNL/conduit#1041.
Simple CMakeLists.txt
Hi! I’m looking to build this with nix, and the way you have the custom install (running the bash script) is not ideal (it largely won’t work). Can you share the most vanilla method to get this built with a standard cmake command? Apologies I’m not a heavy cmake user so I don’t really know how it works. Thanks!…View Comment
vsoch closed issue singularityhub/shpc-guts#5.
How about Debian ?
…just noticed this! :-) …View Comment
vsoch closed issue snakemake-workflows/rna-seq-star-deseq2#56.
Data availability?
Hi! I’m using this as a test workflow for an orchestration tool, and I’m not familiar with the workflow itself and am looking for some dummy data so this doesn’t happen: …View Comment
vsoch open issue flux-framework/flux-operator#44.
Add flux to nextflow
see https://github.com/nextflow-io/nextflow/issues/3403. …View Comment
vsoch open issue snakemake-workflows/rna-seq-star-deseq2#56.
Data availability?
Hi! I’m using this as a test workflow for an orchestration tool, and I’m not familiar with the workflow itself and am looking for some dummy data so this doesn’t happen: …View Comment
vsoch open issue nextflow-io/nextflow#3403.
Flux Framework executor support
## New feature …View Comment
vsoch open issue flux-framework/flux-restful-api#31.
User interface needs a cancel button!
Should be on the job info page, and possibly also in the table (if the job is still running)….View Comment
vsoch closed issue flux-framework/flux-restful-api#21.
Ensure we have emojis and colored buttons for every job state / result
I think I just check for INACTIVE and COMPLETED and the rest are just rendered as blue - my lazy :butterfly: needs to look at the Flux docs and just get the entire list and get this done….View Comment
vsoch open issue flux-framework/flux-operator#42.
Add shellcheck to pre-commit
See https://github.com/koalaman/shellcheck-precommit…View Comment
vsoch open issue flux-framework/flux-restful-api#23.
Multi-user case
If anyone can articulate the design for flux for the multi-user case, we can have a discussion about what that means for the server here (e.g., a user login based on PAM, LDAP, OAuth2 or similar that maps to a cluster resource)….View Comment
vsoch open issue flux-framework/flux-restful-api#22.
Try playwright for automated tests of browser endpoints
https://playwright.dev/python/docs/test-runners …View Comment
vsoch open issue flux-framework/flux-restful-api#19.
Environment variables to add for config
- FLUX_HAS_GPUS
should be set if the cluster supports asking for gpus_per_task. If not allowed, any requests that ask for GPUs should respond with a meaningful error message.
…View Comment
vsoch open issue spack/spack#33820.
Gnome download urls for libxml2 are borked
You can see the failure here: …View Comment
vsoch open issue jetpack-io/devbox#267.
Adding a NUR (Nix User Repository!)
Hi! I’d like to install packages from a NUR: https://nur.nix-community.org/repos/rseops/ and just doing devbox add
didn’t work.
…View Comment
vsoch closed issue weaveworks/grafanalib#538.
Generate dashboard json from inside Python
Hi! I’m new to this project and Grafana, and I’m wondering if it’s possible to reproduce: …View Comment
vsoch open issue weaveworks/grafanalib#538.
Generate dashboard json from inside Python
Hi! I’m new to this project and Grafana, and I’m wondering if it’s possible to reproduce: …View Comment
vsoch open issue grafana/grafana-csv-datasource#223.
How to configure in conf/provisioning/datasources
Hi! I’m new to setting up grafana so apologies if this is detailed somewhere. I’ve been able to manually set up the datasource in the GUI, but I’d like to have it automatically done on container start. I know I can do this via a yaml file in conf/provisioning/datasources
but it’s not clear to me how a local path would be defined in this space. Could docs be added for this somewhere? Thank you!…View Comment
vsoch open issue rse-ops/rajaperf-tutorials#3.
refactor Dockerfile to use spack.yaml
The build is locally working but not in the CI? My current setup is a bit of a hack, I’m going to try using spack as a more “streamlined” way of doing it (we hope!)…View Comment
vsoch closed issue rse-ops/ci#17.
Actions here need updates to set-output, etc.
I’ll need to add a new parser type to the action updater!…View Comment
vsoch closed issue vsoch/action-updater#17.
Handling of non-zero return code
https://github.com/vsoch/action-updater/blob/2c2216e27ee963aaa31ff0ff81de007acb84b9c2/action.yaml#L67 …View Comment
vsoch closed issue singularityhub/guts#7.
Tar - skip /dev (permissions denied, of course)
See https://github.com/singularityhub/shpc-registry-cache/actions/runs/3312246902/jobs/5468685611…View Comment
vsoch closed issue singularityhub/singularity-hpc#550.
Code Quality
I think after we have the following PRs reviewed and merged: …View Comment
vsoch open issue iyaja/batch-job-submitter#1.
Question: where does this run?
Hi! Where is this expected to run? I assume sbatch doesn’t exist on a GitHub provided runner - are you using somewhere with a GitHub runner that has slurm integration? …View Comment
vsoch open issue singularityhub/sregistry#407.
idea: spec out idea for builder operator
vsoch closed issue flux-framework/spack#32.
check for new releases should be part of spack updater
instead of requiring a script here, this should be part of the spack updater action….View Comment
vsoch closed issue expfactory/expfactory#160.
Token preemptively completing experiments...
## Version of Experiment Factory: …View Comment
vsoch open issue google/go-cloud#3171.
[question] Cross cloud deployment (without terraform?)
Hi! I was attracted to this project because it looks like you have nice interfaces to communicate with storage (and a few other flavors of things) across clouds. However, for deployment of something to an instance (e.g., GCP compute engine vs. AWS EC2) it looks like you fallback to using terraform (in your guestbook example): …View Comment
vsoch open issue flux-framework/flux-operator#31.
Support > 1 container in CRD
Double rainbow :rainbow: or double container? :thinking: ? …View Comment
vsoch closed issue singularityhub/singularity-hpc#568.
shpc load
vsoch open issue tunel-apps/tunel#31.
Test podman run into an interactive node
We’d want to be able to launch an interactive node and shell into a podman container with –userns and overlay active so it “feels like” they have root….View Comment
vsoch closed issue flux-framework/spack#24.
[package-update] request to open pull request.
This is a request to open a pull request for a package update….View Comment
vsoch closed issue syspack/pakages#20.
Bug with spack listdir - should only include directories with package.py
vsoch closed issue sciworks/spack-updater#35.
Check if an issue is already open before opening another one
vsoch closed issue danger89/fake-useragent#9.
Server issues?
Heyo! I was attempting to install this updated version, and I think there is either a bug or I’m erroneously using the old repository. Here is the bug: …View Comment
vsoch open issue vsoch/lolcow-operator#3.
Bugs I need to figure out
- [ ] ensure we check if a service is already running on the port, and don’t recreate …View Comment
vsoch open issue tunel-apps/tunel#25.
Should be a way to provide secrets for apps
vsoch closed issue rse-ops/ci#11.
Add a reference to Radiuss-Shared-CI
## Add Reference to: …View Comment
vsoch open issue tunel-apps/tunel#19.
Take a look at stress-ng
https://github.com/ColinIanKing/stress-ng…View Comment
vsoch open issue snakemake/snakeface#25.
Refactor to work with tunel
I think this will be better scoped as a single container app, and one that can be deployed easily to HPC with tunel: https://vsoch.github.io/tunel. Going to start working on this today! …View Comment
vsoch open issue coder/code-server#5423.
[Docs]: Documentation for unix socket deployment?
Hi! I’m not sure if this is possible because I can’t find it - but I’m looking to run the container and instead of specifying a hostname/port, I want to specify a --socket
instead (akin to how jupyter notebook does it). Is there support/documentation for this? The use case is running on remote servers that don’t allow you to map ports (sockets work here!) Thank you!…View Comment
vsoch open issue vsoch/tunel#17.
Custom templates needed (wanted)
- [ ] django …View Comment
vsoch open issue pydicom/deid#210.
How to de-identify a `pydicom.Dataset`?
Hi, …View Comment
vsoch open issue singularityhub/singularity-hpc#568.
shpc load
vsoch open issue lvc/abi-compliance-checker#118.
Docker Container?
Hi! …View Comment
vsoch closed issue buildsi/spliced#1.
Create GitHub action to run splices
And use in spack-splices…View Comment
vsoch open issue buildsi/spliced#15.
Vanessa TODO
- [ ] Check for symlinks in elfcall https://github.com/vsoch/elfcall/issues/7 …View Comment
vsoch open issue midjourney/docs#21.
Nat Friedman not CEO of GitHub anymore!
Hi there! …View Comment
vsoch open issue rseng/software#292.
Consider refactor to use jekyll web instead
or just make a new repository and link to it….View Comment
vsoch open issue rseng/rse#66.
Additional scrapers to add
- [x] ascl https://ascl.net/code/all/limit/250 …View Comment
vsoch open issue rseng/rse#60.
Given custom tags or taxonomy...
we can add a custom circle plot to show some kind of relationship in the jekyll-web template. https://codepen.io/samsonite123/pen/BxoxYd. …View Comment
vsoch open issue vsoch/tunel#15.
Try shiny example
https://community.rstudio.com/t/using-shiny-with-unix-domain-sockets/66081/4…View Comment
vsoch open issue rseng/rse#59.
Extended rse software for custom UI and analysis
We discussed today being able to make a more customized (and simpler) UI for a software database, and I think this should be supported with rse export jekyll-web /docs
. Specifically we also want:
…View Comment
vsoch open issue oras-project/oras-py#40.
Convert rst to markdown :)
vsoch open issue vsoch/tunel#12.
Add HTCondor Launcher
vsoch open issue vsoch/cle#17.
Aggregate locations from eightbytes need to "bubble up"
vsoch open issue selwin/python-user-agents#119.
Generation of user-agent or just parsing?
Hi! Does this library allow random generation of a valid user agent string, or is it just for parsing them? Thanks!…View Comment
vsoch closed issue vsoch/docsy-jekyll#23.
Question: Does this theme support versioning?
Understanding that software moves through versions over time, does this theme support having different versions of the docs?…View Comment
vsoch open issue vsoch/tunel#6.
Try out proxy jump
see https://www.redhat.com/sysadmin/ssh-proxy-bastion-proxyjump…View Comment
vsoch open issue vsoch/tunel#4.
add rich for better colored output!
vsoch closed issue jupyterlab/jupyterlab#12731.
Support for unix socket
### Problem …View Comment
vsoch open issue jupyterlab/jupyterlab#12731.
Support for unix socket
### Problem …View Comment
vsoch closed issue singularityhub/singularity-hpc#453.
Naming of the module.tcl files?
Is there a reason why the module files are all named module.tcl in the software/version directory (first screenshot) instead of named as the version in the software directory (second screenshot)? The latter is the naming convention on all the HPC environments I have used to date. …View Comment
vsoch closed issue spack/spack#31152.
superlu-dist error when trying to install petsc
Using gcc 12.1, I get: …View Comment
vsoch open issue oras-project/oras-py#38.
Remove redundant docs
I have about 1-2 days more worth of checks, but after that we can remove redundant docs from oras-py here as they are represented under https://github.com/oras-project/oras-www. I will likely tweak the python page there retrieved from here) so it has links to the more detailed developer docs/notes here….View Comment
vsoch open issue libabigail/libabigail#3.
Automation Needed
- [ ] Clone and update main branch here with upstream …View Comment
vsoch open issue spack/spack#31152.
superlu-dist error when trying to install petsc
Using gcc 12.1, I get: …View Comment
vsoch open issue oras-project/oras-py#37.
Provider decision to generalize or simplify
The current decision to have a provider Register with generic push/pull and args/kwargs is because I wasn’t sure about the design wanting to be moved toward the “copy” way, where we have a generic idea of a provider that has the same interactions. Moving forward, we have two choices (and it would be good to discuss to come to a decision): …View Comment
vsoch open issue vsoch/qme#44.
Error message is misleading
https://github.com/vsoch/qme/blob/72b3ded78566926083314e8690fd11123d31af21/qme/client/start.py#L25 …View Comment
vsoch open issue LLNL/radiuss#38.
Broken dependency chain llnl-scraper -> github3
I am hopefully working on a fix for this! The details are here: https://github.com/LLNL/scraper/pull/59 and the fix here: https://github.com/LLNL/scraper/pull/60 and when that is released I’ll finish the PR here https://github.com/vsoch/contributor-ci/pull/22 that will be used for the action here. …View Comment
vsoch open issue vsoch/cle#2.
Examples to add
Currently we have the examples under ./examples and we should review a list to add (corresponding to tests)….View Comment
vsoch open issue psychedelhic/csscleaner#2.
Did you ever write this?
Hiya! I was looking up the name “csscleaner” on pypi and I found this project, which hasn’t had activity for almost 10 years? Did y’all ever write this? Were you just name squatting? I’m going to write a tool that does this in Python and I’d be interested in the name if you aren’t really using it. Thanks!…View Comment
vsoch open issue vsoch/cle#1.
Finish parsing with "smeagle model"
We could just use location lists, but “THERE IS NO OTHER WAY!” so I am working on… that :) …View Comment
vsoch open issue oras-project/oras-py#33.
Consider HEAD to blob before upload
Currently we just try pushing a blob and don’t check if it already exists. I tried adding this to #32 but hit some bugs that I didn’t have bandwidth to work through, so I’m creating an issue as a reminder this is worth trying again. …View Comment
vsoch open issue mamba-org/boa#281.
Getting Started Noob Questions
heyo! I’m following the README to test things out, basically I want what would be considered a “getting started” set of steps, and I am a good test user for this because I’m not totally familiar with the details of mamba. So first I tried cloning and installing: …View Comment
vsoch closed issue oras-project/oras-py#14.
Look into testing with authentication
For just login/logout, since we can’t deploy an auth server easily….View Comment
vsoch open issue cncf/foundation#352.
DCO Policy for bots?
Hey CNCF! …View Comment
vsoch open issue MillionConcepts/software_project_management#1.
This is so cool!
hey @cmillion ! I love this! Is there an opportunity to make some kind of template, either a web interface you can export from or even just a template repository with a simple set of sections that pretty render into a project document on pages. It could even be a combination of that and then pre=populated Google sheets, etc. We could make it as a project in https://github.com/rseng if you are interested. Thoughts?…View Comment
vsoch closed issue pydicom/deid#200.
Keeping up-to-date with CTP
Hi …View Comment
vsoch closed issue angr/angr#3125.
Simple example to parse metadata about functions for a binary
Hi angr team! I’m looking to do some simple binary analysis to get a listing of functions, and for each: …View Comment
vsoch open issue oras-project/oras-go#147.
Support for retry?
hey oras developers! …View Comment
vsoch open issue buildsi/spliced#12.
Updates for spliced and smeagle
- spliced: add comment function for pyclingosolver (I can do this) …View Comment
vsoch open issue saferwall/elf#2.
Question: difference between this and debug/elf?
Hiya! I’m interested in parsing ELF with Go (and I’ve used debug/elf a bit) and I’m wondering what are the main differences here? Thank you!…View Comment
vsoch closed issue google/go-containerregistry#1275.
question: Can crane get config for oras artifact on GitHub packages?
This isn’t urgent because (at least for now) I don’t need a config for this particular container! But I was playing around, and trying to …View Comment
vsoch closed issue LLNL/radiuss#34.
Reconcile which repos belong here
Comparing sources of truth: …View Comment
vsoch open issue vsoch/citelang#28.
Ensure R packages don't parse out .
There are some packages with a . in the name: …View Comment
vsoch open issue pydicom/deid#207.
Add deid provided functions
Discussion here: https://github.com/pydicom/deid/pull/203#issuecomment-1071870150 …View Comment
vsoch closed issue autamus/cuppa#3.
Docker update not considering date
It looks like for latest we are just comparing the strings semantically and not considering the order in which they are added? https://github.com/autamus/cuppa/blob/0911790f92cef38d9fe19e724b93d8a09dc47e48/providers/docker/containers.go#L50-L65 …View Comment
vsoch open issue ossf/scorecard-action#150.
Questions about branding and hash pinning
Hey scorecard maintainers! So I just watched this video and haven’t tried out the action yet, but I wanted to ask some questions about / discuss some of the things mentioned there. …View Comment
vsoch open issue github/codeql#8502.
Running codeql on prebuild-container
Description of the issue …View Comment
vsoch open issue rse-ops/docker-images#88.
changes to spack external find?
vsoch open issue USRSE/usrse.github.io#755.
Daylight savings time
There seems to be a brief time when UTC daylight savings hasn’t started (but ours has?) until March 27th where the hours are going to be off by one. This is super annoying, yes, and we can decide if it’s worth trying to fix. See the discussion here: …View Comment
vsoch open issue spack/spack#29434.
spack uninstall with --force should not ask me for confirmation
```bash …View Comment
vsoch open issue LLNL/radiuss#24.
XBraid/xbraid
- [ ] docs branding …View Comment
vsoch open issue rse-ops/rse-ops.github.io#10.
Design of rse-ops: docs vs. projects
@alecbcs had some good feedback that the portal (and then directing to /home) is kind of awkward, e.g., here: https://rse-ops.github.io/home/. I think I’d like to make it better - we want rse-ops to be a nice site! @davidbeckingsale and @alecbcs do you want to discuss what we can make better (design and organization wise) and I can start working on it? …View Comment
vsoch open issue rse-ops/docker-images#81.
hip failing: meson no such file or directory
vsoch closed issue singularityhub/singularity-hpc#383.
Allow for script definition instead of alias
Describe the bug …View Comment
vsoch open issue TreeTide/underhood#69.
codeunderhood.com still online?
Hi! I wanted to check out codeunderhood.com but it’s just infinitely spinning (and I suspect will time out). Is this a temporary thing or is it permanently offline? If the latter, do you have a container build that would make it easy to preview locally, or someone using it that I could look at? Thanks!…View Comment
vsoch open issue online-ml/chantilly#24.
Update metrics - possibly missing a case?
heyo! So I am using the same logic as chantilly after I’ve hit the learn endpoint and want to update metrics. Here is the basic logic refactored into its own function: …View Comment
vsoch closed issue online-ml/river#856.
model.to_dict()?
Hiya! I am adding a view to quickly return some summary for a model, and I’m wondering if there could be some kind of model.to_dict(). E.g., it looks like some basic view of this could be unwrapped? …View Comment
vsoch open issue online-ml/river#856.
model.to_dict()?
Hiya! I am adding a view to quickly return some summary for a model, and I’m wondering if there could be some kind of model.to_dict(). E.g., it looks like some basic view of this could be unwrapped? …View Comment
vsoch closed issue singularityhub/github-ci#12.
container-builder-template link broken in README
vsoch closed issue singularityhub/singularity-hpc#488.
Release updating
I follow GitHub for your latest release: https://github.com/singularityhub/singularity-hpc/releases …View Comment
vsoch open issue LLNL/apollo#19.
Update description
hey apollo team! I think it would be good to come up with a slightly tweaked description for the project, because it technically isn’t online (incremental) learning. I was chatting in Mattermost with @davidbeckingsale about it. I don’t know the project well enough to make a suggestion - but is it some kind of batch ML server perhaps?…View Comment
vsoch closed issue online-ml/river#834.
Question: does it handle concurrency?
Hi! I’m browsing around to learn about real time machine learning and I stumbled on this repo. You mention using dicts for data, and also have an example of a flask app to update a mode l- https://riverml.xyz/latest/user-guide/reading-data/ but does river handle concurrency? E.g., I’m trying to figure out what best practices are for making a production server with the main purpose of receiving new data and updating a model. The logical thing is to keep some queue of new entries and run a batch update overnight, but I’m hoping there are more sophisticated “real time” methods that somehow map the model to the database. Thanks!…View Comment
vsoch open issue martialblog/word_embedding_storage#8.
question: storing model in database?
Hi @martialblog ! Have you given any thought to (or know about) efforts to store not just the vectors, but the model itself in a database? I’m just starting to explore this idea and was doing searching and came across your repo here. Thank you!…View Comment
vsoch closed issue buildsi/Smeagle#66.
Consider "unknown" or "unparsed" type
We currently leave out a parsing if we’ve seen it before. This approach would instead add an “unknown” type (or similar named, something along the lines of being left out) instead….View Comment
vsoch open issue rseng/rsepedia-analysis#1.
Next steps to try
I want to improve the quality of the current models, so I want to re-run the current and also parse Rmd, rst, and Rd. …View Comment
vsoch closed issue containers/podman#12688.
CreateInit is public but requires a private cobra cmd
Hiya! So I’m working on integrating podman into one of my applications, and I hit a blocker when trying to do a run, at this point: …View Comment
vsoch open issue vsoch/elfcall#6.
Make logo, branding, and docs
vsoch open issue rseng/software#259.
Redo UI with data table
It’s really too big for the old layout!…View Comment
vsoch closed issue USRSE/gallery#2.
Issue templates
I clicked on “Open Blank Issue” and it took me to USRSE/usrse.github.io issues. For this site we probably want a subset of the current template choices (or a slight modification to those here: https://github.com/USRSE/gallery/blob/main/.github/ISSUE_TEMPLATE/config.yml)…View Comment
vsoch closed issue zenodo/developers.zenodo.org#34.
"params" does not work in example
hey Zenodo! I’m not sure if this has changed over time, but the example that uses “params” doesn’t seem to work (I just tested for this particular use case) …View Comment
vsoch closed issue mbaykara/k8s-cluster#2.
Quick question: connection refused second node
hey @mbaykara ! I have some experience deploying apps, but am relatively new to admin k8s. I have one external server and decided VMs (Vagrant) would be a good avenue - and that’s how I stumbled on your post! Everything is going smoothly so far - I have my main node running and one worker registered, but for the second worker: …View Comment
vsoch closed issue rse-ops/docker-images#51.
Output in workflow != local
@davidbeckingsale I updated uptodate with a fix for the exclude matrices, but when I pull the container I can’t reproduce the output in the job https://github.com/rse-ops/docker-images/runs/4682583744?check_suite_focus=true e.g., the excludes don’t seem to be excluded there? Could you take another look / be another set of eyes to see if maybe I am just not seeing something - did I add the exclude to the right file?…View Comment
vsoch open issue pakages/zlib#2.
Steps Needed for Automation
- [ ] add binoc to detect a new version of a package.py …View Comment
vsoch open issue rse-ops/docker-images#49.
Bugs to fix
It looks like: …View Comment
vsoch open issue vsoch/spack-package-action#7.
Try turning into a library proper
And then allowing spack to install from it….View Comment
vsoch open issue spack/spack#27985.
spack test run caliper fails (missing example)
```bash …View Comment
vsoch closed issue spack/spack-monitor#21.
Updates to Spack Monitor
Please write all issues that should be addressed for the next round of fixes to spack monitor! …View Comment
vsoch open issue firekccz3/singularitywarea#1.
Commit date
Hey! I think you have an issue with your commit dates - they are years into the future …View Comment
vsoch open issue vsoch/codestats#3.
Create organization summary metrics
maybe a different tab in the UI?…View Comment
vsoch open issue openPMD/openPMD-api#1153.
Build Cache!
hey @ax3l I have the workflow ready to deploy to the build cache here! You can see the PR against my repo here: https://github.com/researchapps/openPMD-api/pull/1 and the workflow running here: https://github.com/researchapps/openPMD-api/runs/4396635238?check_suite_focus=true …View Comment
vsoch open issue spack/spack#27741.
mpich missing dependency?
### Steps to reproduce…View Comment
vsoch open issue spack/spack#27667.
Missing variant in package cbtf-krell
```bash …View Comment
vsoch open issue spack/spack#27652.
Install with cmake (zlib) fails
### Steps to reproduce…View Comment
vsoch open issue buildsi/spack-splices#3.
Failures to record / various notes
- concretization failures …View Comment
vsoch open issue buildsi/spack-monitor-analysis-containers#3.
Create data structure "static API" to show contents
It would be nice to know (in advance) what the hashes are for a container - so either we can label on the fly (probably not ideal to lose arches) or we can create some kind of an API / method to get this information for a container. E.g.,: post build hook that can run, sniff the contents with a script, and save somewhere….View Comment
vsoch open issue autamus/builder#2.
Spack Label Schema
hey @alecbcs ! I am building some base containers for spack, and I was thinking we should design a label schema, e.g., right now it’s common to see: …View Comment
vsoch open issue JasonYangShadow/lpmx#28.
Related tools
Hi! I saw your repository and took a look - for related tools you can also include Singularity Compose, which does actual orchestration for Singularity containers (volumes, networking, etc.). Thanks!…View Comment
vsoch open issue buildsi/spack-monitor-analysis#5.
Look into caching
once we have builds going, we won’t need to upload binaries we’ve already done for some version/os/arch, so a cache will be useful! See https://github.com/actions/cache…View Comment
vsoch closed issue singularityhub/github-ci#2.
update recipe to use oras push/pull
https://github.com/singularityhub/github-ci/blob/713df0b3b1d7f97aba421210ae297a343227e80e/.github/workflows/native-install.yml#L36…View Comment
vsoch open issue rse-ops/roadmap#10.
Community initiatives
- [ ] Halloween post …View Comment
vsoch closed issue singularityhub/docker2singularity#106.
hit a panic: runtime error
Hello: …View Comment
vsoch open issue vsoch/usrse.github.io#128.
Double check jobs
It’s rendering in my local testing, but for some reason not on the current site: https://vsoch.github.io/usrse.github.io/jobs/. Let’s keep an eye on it….View Comment
vsoch open issue spack/spack#26644.
Clingo bootstrap "cannot import name architecture"
### Steps to reproduce…View Comment
vsoch open issue vsoch/puzzles#4.
Refactor to handle an image
Working on this now!…View Comment
vsoch open issue rse-radiuss/roadmap#8.
RADIUSS Documentation Templates
If we can create common (branded) templates for readthedocs (and other common means to provide documentation on GitHub pages) it would be possible to link to any project’s docs from the main radiuss site and not feel like you transitioned to a totally different thing….View Comment
vsoch open issue rse-radiuss/docker-images#37.
Pull Request Trigger
We need a better way to trigger PRs, e.g., ONLY build matrices where the uptodate.yaml is changed….View Comment
vsoch closed issue sylabs/singularity#338.
#error Minimum supported version of Libseccomp is v2.2.0
Hi Sylabs! I’m trying to update the singularity-orb for CircleCI to have newer versions (and the new path) and I’m running into an error I’m hoping you might have seen before, or can help with: …View Comment
vsoch open issue rse-radiuss/roadmap#4.
Interactive CI Badge
We would like to consolidate the Contributor CI and current policy points to some final set of points, and then have an interactive “generate your own badge” page on the RADIUSS site….View Comment
vsoch closed issue singularityhub/singularity-hpc#439.
Thoughts on the extended syntax for aliases
I have realised one thing by looking at the alias here: …View Comment
vsoch open issue ConradIrwin/go-dwarf#1.
Question about Functions and Types
hey @ConradIrwin ! I’m exploring debug/dwarf and found your library, and was interested to ask you some questions that maybe you stumbled into while working on it. …View Comment
vsoch open issue rse-radiuss/docker-images#32.
Bug in Docker bases
https://github.com/rse-radiuss/docker-images/runs/3670397836?check_suite_focus=true …View Comment
vsoch open issue buildsi/Smeagle#90.
Dyninst segfaults with union
If we have an empty union, Dyninst segfaults …View Comment
vsoch closed issue vsoch/uptodate#42.
Add isEmpty variable
Instead of doing the check and setting in an extra step, the action should be able to do the check and set for the runner….View Comment
vsoch closed issue intel/yarpgen#151.
Question about testing ABI
Hi there! I really like your testing framework, and since you have experience is random program generation was wondering if I could ask you advice for doing something similar for testing application binary interface compatibility. E.g.,: …View Comment
vsoch open issue autamus/binoc#14.
A few bugs to look into (binoc or lookout?)
Just want to consolidate these here! …View Comment
vsoch open issue vsoch/uptodate#40.
Tags are needed
Each generation of a new container (Dockerfile or Docker Build) should include a list of tags that we can return (and then pipe into an interface!)…View Comment
vsoch open issue singularityhub/singularity-catalog#8.
Pin Python version
I had the same issue with the spack stack catalog about not being able to import from __futures__
and I think it’s because Python is < 3.7. I’ll try the fix over there and move over here if it works….View Comment
vsoch open issue rse-radiuss/rse-radiuss.github.io#2.
Create page that showcases containers?
Or maybe it should be served from docker-images? Will think about it….View Comment
vsoch closed issue rse-radiuss/rse-ops#11.
Additional pages to add
In addition to the current content, we would want to add pages for: …View Comment
vsoch open issue matryer/gopherize.me#31.
Development Instructions?
Hi! I was hoping to play around with the source code, but when I go into gae and type go run main.go
I find that (for some reason) it cannot find functions that are in main (but I clearly see them):
…View Comment
vsoch open issue vsoch/uptodate#6.
Output of action should include json that can be used for build matrix
And should include paths to the updated files….View Comment
vsoch open issue archspec/archspec-go#4.
Planning to write detect / other functions?
Hey Archspec, …View Comment
vsoch open issue autamus/binoc#10.
Add parser for Dockerfile
This should be fairly easy to implement, but I’m planning of having a large repository of automated builds and Dockerfile for the different ones, and I’d like the FROM images to be kept up to date, and when there is a new version to have a PR open to test (just right up binoc’s alley, yay!!) so I think this should be fairly easy to do given that shpc already processes the docker URIs - I’ll have a first shot at it….View Comment
vsoch closed issue buildsi/Smeagle#18.
Empty fields should not be output
E.g., this …View Comment
vsoch open issue hpcng/singularity-userdocs#408.
Ensure that previous singularity.lbl.gov docs are reproduced
I’ve recently been going through old Google Groups and GitHub issues, and there is a surprising number of references to old tutorials, instructions, or content on the singularity.lbl.gov page. Currently, if the user clicks they are taken to the home of the hpcng user docs, which gives them no idea about the previous content, is confusing, and considered bad practice to have so many broken links. I’ve discussed this in slack, and there are a few options: …View Comment
vsoch open issue vsoch/rse-ops#3.
Add paper and convert latex to markdown
I want the sections of the paper to render alongside their respective categories on the site, so I will want to: …View Comment
vsoch open issue vsoch/ood-compose#6.
TODO: update to use modern slurm (20.11)
and try out new rest API!…View Comment
vsoch closed issue singularityhub/github-ci#5.
Typo in a link in README
I believe there are some links in the README aimed at a URL containing ‘workfolws’. …View Comment
vsoch open issue oras-project/oras-go#14.
Question about writerOpts in file.go
Heyo! I have a very naive question about the WriterOpts here: …View Comment
vsoch open issue vsoch/rse-ops#1.
Create yaml with categories and software
We will eventually want to render a landscape interface….View Comment
vsoch closed issue vsoch/docsy-jekyll#47.
Copyright, attribution, etc questions for my fork
Hi @vsoch , I am posting here to let you know that I have forked this awesome theme and am maintaining an unsynced repo. I have maintained heavy attribution and all your copyright notices and so forth, but that this point I think there is little left of your original code, and most of it is in the general layouts and templates. I expected to hue much closer to what you had, because I really liked it, but at this point I’m more concerned that your name is all over something that maybe doesn’t look much like your work. …View Comment
vsoch open issue vsoch/spack#28.
I have an issue with clingo
vsoch open issue containers/buildah#3376.
Dockerfile spec?
Heyo! Someone on the OCI list mentioned that there might have been an effort here (alongside the buildah project) to create a Dockerfile spec. Is that true? I was thinking it would be useful to have a vendor neutral spec for the Dockerfile (and associated things like a caching strategy)….View Comment
vsoch closed issue autamus/binoc#2.
Another use case for binoc!
hey @alecbcs ! I think I have another potential use case for binoc! I started a fun project this weekend which is basically going to be a small local (filesystem) registry of singularity containers for users. The users will access them via lmod, and an admin will manage them via a small client. For the first shot, I’m just going to have admins add their own containers, e.g.,: …View Comment
vsoch open issue spack/spack-bot#8.
Style and other commands for spackbot
from discussion with @tgamblin …View Comment
vsoch open issue opencontainers/distribution-spec#289.
Content Length Required
I wanted to open up some discussion about the Content-Length
header. Currently the spec does not say that all responses are required to have it, and this possibly makes sense because we could look it up, say, in the manifest, but since Content-Length is sort of bread and butter when it comes to communicating about requests/responses, is there any reason to not have it? Many libraries can find this field useful, so we are proposing to have it added to all registry responses that make sense for a future release:
…View Comment
vsoch closed issue vsoch/usrse.github.io#19.
Community Documents
https://vsoch.github.io/usrse.github.io/community-documents/ - I find this page confusing. I suggest starting with a list of community documents with short descriptions, and possibly their status? In progress, planning, draft done, etc.
…View Comment
vsoch open issue LLNL/scraper#58.
Use logging instead of printing
Heyo! I’m wondering if instead of having a bunch of print statements (that the user cannot control) …View Comment
vsoch open issue spack/spack#24552.
mpi not shown as a package with spack.repo.all_package_names
I’m not sure if this is a bug or the package is intentionally left out, but I noticed that mpi doesn’t come through in listing packages via spack.repo.all_package.names: …View Comment
vsoch closed issue vsoch/vsoch.github.io#24.
Just trying to help you update your blog!
I saw that you left Stanford for another opportunity. …View Comment
vsoch open issue spack/spack#24530.
Use Cases
As a supplement to https://github.com/spack/spack/issues/24529, it would be really useful to have a clearly defined “Use Cases” sections in the documentation. E.g., speaking to the use cases in the format …View Comment
vsoch open issue vsoch/contributor-ci#9.
Feedback on Pull requests plot
- what
vsoch open issue NERSC/shifter#299.
Shifter pull by digest
Hi! I’m hoping that shifter is able to pull by a digest (and not just a tag)? E.g., one of these I’m wanting to work: …View Comment
vsoch open issue vsoch/contributor-ci#3.
Additional extractors
It would be useful to be able to measure commits for a specific file, e.g.,: …View Comment
vsoch open issue buildsi/Smeagle#14.
Idea about test infrastructure
I was talking with @mplegendre about an idea for a testing setup for Smeagle. The basic idea is that if Smeagle can turn facts into assembly instructions, we could actually compare binaries, which is a better ground truth than hard coding some set of checks that we either have to painfully manually do (and get wrong) or just painfully manually do for many cases. …View Comment
vsoch open issue cea-hpc/modules#400.
Module alias not found in GitHub Workflow
Hi modules team! I hope it’s okay I’m posting here, I’d like to be able to link to a GitHub issue here and have it tracked in the PR. Basically, I have a PR: https://github.com/singularityhub/singularity-hpc/pull/373 that adds a test suite that installs and loads a module, and then uses one of the aliases. For some reason, I am always getting command not found. When I test the same series of commands in an actual cluster environment it works as expected, so I’m thinking that I’m perhaps forgetting to set some environment variable or similar so that the aliases are found? Note that I have two jobs that are failing in this manner - one that installs a singularity container module, and the other that installs a podman container module. I tried changing the syntax of the set-alias commands between the two just for testing, e.g., …View Comment
vsoch open issue buildsi/Smeagle#8.
Notes for (to be renamed) export/import
- if passed by value, imported …View Comment
vsoch closed issue USRSE/usrse.github.io#456.
Updates to events/navigation
vsoch open issue matomo-org/matomo#17622.
Ingest S3 directly
Hi matomo! I am wondering if there is a best practice for ingesting S3 logs directly, ideally from S3 and not needing to sync them to the same matomo server first and then using the script? Thank you!…View Comment
vsoch open issue spack/spack-stack-catalog#3.
bug: stack.yaml links aren't clickable if we go to second page
vsoch open issue matomo-org/matomo#17599.
docker-compose recipe?
Hi Matomo! I’m looking to get this running on an instance, and I’m wondering if anyone has set up a docker-compose recipe for it? I saw the docker-compose.yml in the .gitignore so I suspect that one used to exist, it exists locally for some developer. Any pointers to work started in this direction would be greatly appreciated! I’d definitely rather use containers than installing everything natively. If there is more development/help needed to create this recipe I could also offer. Thank you! …View Comment