Skip to content

Remote submit issues #40

Description

@jungerm2

I'm trying to follow the remote-submit example and keep running into an issue while connecting to the remote pool. Specifically, I get a "HTCondorException: Unable to locate local daemon." error.

I assumed this was because I didn't have condor installed locally and went ahead and installed it. This guide, as well as the docs here don't really make it clear if you need to install condor on your system to use the python library, so I'm unsure about this.

I'm on linux so I just followed the install procedure here which led to a "Error: Unsupported distribution Fedora". I'm not sure what makes Fedora incompatible with this install method, especially since redhat seems to be supported. Instead, I installed it via my package manager, and while the install succeded condor_schedd gave the following error:

ERROR "Cannot open log file '/var/log/condor/SchedLog'" at line 173 in file /builddir/build/BUILD/condor-23.9.6-build/condor-23.9.6/src/condor_utils/dprintf_setup.cpp

In the master log file (in /var/log/condor/MasterLog) the daemon was quitting because error: couldn't open file "/var/log/condor/ProcLog" for logging: Permission denied and same for SchedLog. I fixed this by manually creating these log files and ensuring proper permissions. I also had to create/chown /var/lock/condor, /etc/condor, /var/lib/condor and /var/run/condor which seems like a horrible idea and something that should be done on install, but now the daemon starts correctly but the collector host fails to connect:

# From /var/log/condor/MasterLog: 
04/22/26 14:29:14 Daemons::StartAllDaemons all daemons were started
04/22/26 14:29:16 Setting ready state 'Ready' for STARTD
04/22/26 14:29:19 attempt to connect to <128.105.68.172:9618> failed: Invalid argument (connect errno = 22).  Will keep trying for 20 total seconds (20 to go).
04/22/26 14:29:39 attempt to connect to <128.105.68.172:9618> failed: Invalid argument (connect errno = 22).
04/22/26 14:29:39 ERROR: SECMAN:2003:TCP connection to collector cm.chtc.wisc.edu failed.
04/22/26 14:29:39 Failed to start non-blocking update to <128.105.68.172:9618>.

I've tried both with/without the university VPN (which also seems to be a requirement, since cm.chtc.wisc.edu doesn't resolve without it), but still cannot connect. Is this correct, or did I misconfigure something? Any help would be greatly appreciated.

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions