Author |
Message |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Please describe step by step how the job is being setup and pointed to the agent, and where and how you start it. I am confused by your inconsistent references to the agent and the missing "user authenticated" message in the agent log file.
|
|
Tue Nov 06, 2007 1:31 pm |
|
 |
wmweaver2
Joined: 29 Oct 2007 Posts: 111 Country: United States |
|
|
|
step 1 created agent (IP .163) on master (IP .155) and ran a test it connects when test is run. The I went to add job on the master (.155) selected run a program or batch file selected the program on the command line ls (i was using a perl .pl file) on the start in putting /home/batch/scripts. Keep in mind i have tried a bunch of command line and start in variations. I hit next and leave default timeout options. The next page is probably what i am doing wromg. It says apply credentials and i check the box and enter some that I know work that also happen to match what i am signed into. Then on the host i select the agent (.155) then i finish just pretty much using the defaults. So im creating the job on the master but using the option to run it on the agent. The agent has not had a thing done to it other than being installed and the program running. After the job is set up i hit run now and get the error message exit code -1
|
|
Tue Nov 06, 2007 2:07 pm |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Sounds all right to me. Let's try one simple thing to ensure all settings have been saved, especially remote agent settings. Enable trace option on the master (Tools/Options; Log page) Close master and restart it. Clear the contents of the debug.log file on the master. Run the job. Post the relevant content of the debug.log file from the master.
|
|
Tue Nov 06, 2007 2:14 pm |
|
 |
wmweaver2
Joined: 29 Oct 2007 Posts: 111 Country: United States |
|
|
|
Here you go
2007-11-06 13:20:54,205 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - main(...) : start
2007-11-06 13:20:54,205 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : start
2007-11-06 13:20:54,394 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : creating business objects
2007-11-06 13:20:54,657 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : creating UI controller
2007-11-06 13:20:54,776 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : initializing business objects
2007-11-06 13:20:54,796 [main] DEBUG com.softtreetech.jscheduler.business.preferences.AbstractPrefDatabase - Creating backup for preferences file preferences.xml
2007-11-06 13:20:54,798 [main] DEBUG com.softtreetech.jscheduler.business.preferences.AbstractPrefDatabase - Preferences file has been copied to preferences.bak
2007-11-06 13:20:55,092 [main] DEBUG com.softtreetech.jscheduler.business.jobtemplates.JobTemplateSetImpl - loadDescription: path: Template/test_template.ini
2007-11-06 13:20:55,700 [main] DEBUG com.softtreetech.jscheduler.db.drivers.file.FileJobDbStorage - buildIndex(): totalRecordsNumber: 9
2007-11-06 13:20:55,700 [main] DEBUG com.softtreetech.jscheduler.db.drivers.file.FileJobDbStorage - buildIndex(): deletedRecordsNumber: 1
2007-11-06 13:20:58,740 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : end
2007-11-06 13:20:58,740 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - main(...) : end
2007-11-06 13:21:29,388 [AWT-EventQueue-0] DEBUG com.softtreetech.jscheduler.db.JobDbImpl - updateJob(JobProperties) : start
2007-11-06 13:21:29,391 [AWT-EventQueue-0] DEBUG com.softtreetech.jscheduler.db.drivers.file.FileJobDbStorage - update(...) : start
2007-11-06 13:21:29,391 [AWT-EventQueue-0] DEBUG com.softtreetech.jscheduler.db.drivers.file.FileJobDbStorage - update(...) : end
2007-11-06 13:21:29,393 [AWT-EventQueue-0] DEBUG com.softtreetech.jscheduler.db.JobDbImpl - updateJob(JobProperties) : end
2007-11-06 13:21:32,701 [AWT-EventQueue-0] DEBUG com.softtreetech.jscheduler.business.queue.JobQueue - QUEUE_UNLOCKED
2007-11-06 13:21:32,995 [Job #11 - Unzip PCL] ERROR com.softtreetech.jscheduler.business.runner.RemoteJobRunner - runJob
java.rmi.UnmarshalException: Error unmarshaling return header; nested exception is:
java.net.SocketException: Connection reset
at sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:209)
at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:142)
at com.softtreetech.jscheduler.business.agent.remote.RemoteAgentImpl_Stub.executeJob(Unknown Source)
at com.softtreetech.jscheduler.business.runner.RemoteJobRunner.runJob(Unknown Source)
at com.softtreetech.jscheduler.business.runner.AbstractJobRunner.do(Unknown Source)
at com.softtreetech.jscheduler.business.runner.AbstractJobRunner.Ó00000(Unknown Source)
at com.softtreetech.jscheduler.business.runner.AbstractJobRunner.execute(Unknown Source)
at com.softtreetech.jscheduler.business.runner.JobExecutorImpl.execute(Unknown Source)
at com.softtreetech.jscheduler.business.runner.JobExecutorImpl$1.run(Unknown Source)
at java.lang.Thread.run(Thread.java:619)
Caused by: java.net.SocketException: Connection reset
at java.net.SocketInputStream.read(SocketInputStream.java:168)
at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
at java.io.BufferedInputStream.read(BufferedInputStream.java:237)
at java.io.DataInputStream.readByte(DataInputStream.java:248)
at sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:195)
... 9 more
2007-11-06 13:21:32,999 [Job #11 - Unzip PCL] DEBUG com.softtreetech.jscheduler.business.A.O0oO - disableJob(...) : Job with id [11] is removed.
2007-11-06 13:21:33,005 [Job #11 - Unzip PCL] DEBUG com.softtreetech.jscheduler.db.JobDbImpl - updateJob(JobProperties) : start
2007-11-06 13:21:33,007 [Job #11 - Unzip PCL] DEBUG com.softtreetech.jscheduler.db.drivers.file.FileJobDbStorage - update(...) : start
2007-11-06 13:21:33,007 [Job #11 - Unzip PCL] DEBUG com.softtreetech.jscheduler.db.drivers.file.FileJobDbStorage - update(...) : end
2007-11-06 13:21:33,007 [Job #11 - Unzip PCL] DEBUG com.softtreetech.jscheduler.db.JobDbImpl - updateJob(JobProperties) : end
2007-11-06 13:21:33,026 [Job #11 - Unzip PCL] DEBUG com.softtreetech.jscheduler.business.queue.JobQueue - QUEUE_UNLOCKED
2007-11-06 13:21:33,154 [Job #11 - Unzip PCL] ERROR com.softtreetech.jscheduler.business.runner.JobExecutorImpl - Job errors: Remote job failed. Exit code: -1
|
|
Tue Nov 06, 2007 2:24 pm |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
The important line there is "Caused by: java.net.SocketException: Connection reset" So it is a connection or authentication issue. Do you have a firewall or some other network monitoring software blocking connections on port 1096?
|
|
Tue Nov 06, 2007 2:46 pm |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Just in case, try using telnet on port 1096 to "ping" the agent computer. I wander if you get "unreachable" or "connection refused" error.
Also, for the connection you do specify a valud Linux user and password available on the agent computer, right?
|
|
Tue Nov 06, 2007 2:49 pm |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
By any chance, you didn't chnage the default port 1096 in the remote agent properties on the scheduler side to 1097 or something else. Right?
|
|
Tue Nov 06, 2007 3:16 pm |
|
 |
wmweaver2
Joined: 29 Oct 2007 Posts: 111 Country: United States |
|
|
|
No the telnet test works we tried it. I assumed that it would work since i can select the agent i set up and run test connection. It prombts me for the password I enter it and it says connection to remote host succeeded. I will try some other things.
|
|
Tue Nov 06, 2007 3:42 pm |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Any new information?
Are you sure that not even a single line appears in the debug.log on the agent during test connects or job runs? Did you restart the agent after preferences.xml file change? Is the value still "true" for the "traceEnabled" option?
|
|
Tue Nov 06, 2007 4:37 pm |
|
 |
wmweaver2
Joined: 29 Oct 2007 Posts: 111 Country: United States |
|
|
|
Ive restarted just about everything did a netstat -PO -p 1000-2000 and see the port open and listening. made sure no firewall rules are blocking anything. The only thing in the debug.log on the agent is what looks to me like the agen is starting. Went into the pref xml file on the agent and changed FTP tp PAM and back again. Im stumped. I am going to see if i can get the master to run a job on a agent that is on a windows box just to rule out the master as being the issue.
2007-11-06 13:14:18,266 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - main(...) : start
2007-11-06 13:14:18,268 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : start
2007-11-06 13:14:18,276 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : creating business objects
2007-11-06 13:14:18,461 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : creating UI controller
2007-11-06 13:14:18,600 [main] DEBUG com.softtreetech.jscheduler.JSchedulerStarter - startup() : initializing business objects
2007-11-06 13:14:18,674 [main] DEBUG com.softtreetech.jscheduler.business.preferences.AbstractPrefDatabase - Creating backup for preferences file preferences.xml
2007-11-06 13:14:18,679 [main] DEBUG com.softtreetech.jscheduler.business.preferences.AbstractPrefDatabase - Preferences file has been copied
|
|
Tue Nov 06, 2007 5:14 pm |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Please check which JDK/JRE you are using for the scheduler and for the agent. Make sure you are using standard version 1.4 for both and not something else, not a crippled repackaged JRE version that comes with some Linux distros.
To verify the version run
$JAVA_HOME/bin/java -version
|
|
Tue Nov 06, 2007 5:28 pm |
|
 |
wmweaver2
Joined: 29 Oct 2007 Posts: 111 Country: United States |
|
|
|
jdk1.6.0_03 so we are using the latest version. Hopefully i can figure this out today I would really like to replace the job scheduler we are using now. I will let you know on my progress is you have any other suggestions please let me know.
|
|
Wed Nov 07, 2007 9:58 am |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Ok, here is the catch "using latest" version is the root cause of the problem.
JDK versions are not backward compatible!!! As I mentioned earlier, you must be using JDK or JRE 1.4. Please fix this problem. You don't have to replace the existing JDK, different applications can use different versions, just install 1.4 into a separate directory, if you don't have it already and setup $JAVA_HOME (you can add export command in the beginning of 24x7 .sh files ) to point to the JDK/JRE 1.4
You must do it on both sides, agent and master scheduler.
|
|
Wed Nov 07, 2007 11:00 am |
|
 |
wmweaver2
Joined: 29 Oct 2007 Posts: 111 Country: United States |
|
|
|
Ok , the install instructions say v 1.4.x or better. I assumed the 1.4 or better meant anything newer than 1.4. If this fixes it i wont care and will jst be happy to move on.
|
|
Wed Nov 07, 2007 11:46 am |
|
 |
SysOp
Site Admin
Joined: 26 Nov 2006 Posts: 7969
|
|
|
|
Sorry to say that, but the install instructions were crafted before 1.5 and 1.6 releases and it was assumed that future Java versions would be backward compatible. Unfortunately this is not the case, Sun haven't done a good job in terms of compatibility and we haven't had get a chance to fix these old instructions.
|
|
Wed Nov 07, 2007 12:11 pm |
|
 |
|