Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cms-connect single job submit script #2881

Open
singh-ramanpreet opened this issue Apr 16, 2021 · 6 comments
Open

cms-connect single job submit script #2881

singh-ramanpreet opened this issue Apr 16, 2021 · 6 comments

Comments

@singh-ramanpreet
Copy link
Contributor

condor jobs are crashing when using submit_cmsconnect_gridpack_generation_singlejob.sh

Below is condor job stderr

... Unable to find GlideinWMS utilities (../../gwms from /var/condor/execute/dir_19927/glide_YH7iCq/execute/dir_34145)
WARNING  glidein_config not defined () in singularity_lib.sh. Some functions like advertise and error_gen will be limited.
+ set -e
+ name=ZZTo4L01j_5f_NLO_FXFX
+ carddir=cards_ZZTo4L01j_5f_NLO_FXFX
+ queue=local
+ jobstep=ALL
++ cat /etc/redhat-release
+ export 'SYSTEM_RELEASE=CentOS Linux release 7.9.2009 (Core)'
+ SYSTEM_RELEASE='CentOS Linux release 7.9.2009 (Core)'
+ '[' -n slc7_amd64_gcc700 ']'
+ scram_arch=slc7_amd64_gcc700
+ '[' -n '' ']'
+ [[ CentOS Linux release 7.9.2009 (Core) == *\r\e\l\e\a\s\e\ \6* ]]
+ [[ CentOS Linux release 7.9.2009 (Core) == *\r\e\l\e\a\s\e\ \7* ]]
+ cmssw_version=CMSSW_10_6_19
+ '[' -z '' ']'
++ pwd
+ PRODHOME=/srv
+ helpers_dir=/srv/genproductions/Utilities
+ helpers_file=/srv/genproductions/Utilities/gridpack_helpers.sh
+ '[' '!' -f /srv/genproductions/Utilities/gridpack_helpers.sh ']'
++ command -v git
+ '[' -x '' ']'
+ helpers_dir=/srv/Utilities
+ helpers_file=/srv/Utilities/gridpack_helpers.sh
+ source /srv/Utilities/gridpack_helpers.sh
gridpack_generation.sh: line 691: /srv/Utilities/gridpack_helpers.sh: No such file or directory

Have anybody used this script this recently?

I can quickly fix by adding Utilities directory in input tar file, I was wondering if this is correct approach?

@agrohsje
Copy link
Collaborator

Can you comment a bit more on what exactly you are doing (commands) and what the error is? Thanks!

@singh-ramanpreet
Copy link
Contributor Author

singh-ramanpreet commented Apr 26, 2021

I ran the following command,

nohup ./submit_cmsconnect_gridpack_generation_singlejob.sh <process > <cards_dir> 8 "16 Gb" 2880 slc7_amd64_gcc700 &

The error seems to be coming from gridpack_generation.sh when it tries to run on condor.

gridpack_generation.sh: line 691: /srv/Utilities/gridpack_helpers.sh: No such file or directory

@agrohsje
Copy link
Collaborator

Yes. Please make a PR to fix. You can see that in the main one it was changed:
https://github.com/cms-sw/genproductions/blob/master/bin/MadGraph5_aMCatNLO/submit_cmsconnect_gridpack_generation.sh#L225

@singh-ramanpreet
Copy link
Contributor Author

@agrohsje
Thanks for hint, I fixed and actually making gridpack with it before I make PR.

But, I end up with another problem (see below), Do you have any idea?
-> job.err.

Fetching data with command:
  wget --no-check-certificate https://ninja.hepforge.org/downloads//ninja-1.1.0.tar.gz
--2021-05-03 07:11:06--  https://ninja.hepforge.org/downloads//ninja-1.1.0.tar.gz
Resolving ninja.hepforge.org (ninja.hepforge.org)... 129.234.186.186
Connecting to ninja.hepforge.org (ninja.hepforge.org)|129.234.186.186|:443... failed: Connection timed out.
Retrying.

--2021-05-03 07:13:15--  (try: 2)  https://ninja.hepforge.org/downloads//ninja-1.1.0.tar.gz
Connecting to ninja.hepforge.org (ninja.hepforge.org)|129.234.186.186|:443... connected.
Unable to establish SSL connection.
Could not download data at 'https://ninja.hepforge.org/downloads//ninja-1.1.0.tar.gz' because of:
coercing to Unicode: need string or buffer, NoneType found

-> job.out.

Now installing ninja. Be patient...
quit
Process output directory ZZTo4L01j_5f_NLO_FXFX not found.  Either process generation failed, or the name of the output did not match the process name ZZTo4L01j_5f_NLO_FXFX provided to the script.
Something went wrong while running madgraph. Exiting now.

@iarspider
Copy link

@singh-ramanpreet hepforge is down today.

@singh-ramanpreet
Copy link
Contributor Author

@singh-ramanpreet hepforge is down today.

Ah, thanks for the info @iarspider. I guess it's just unfortunate timing then.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants