[LON-CAPA-admin] Auto enroll timeout

H. K. Ng hkng at fsu.edu
Thu May 4 08:24:06 EDT 2006


Hi Stuart,

Thanks for the info. The separate program does not use any of the 
loncapa modules - it is to test calls to the university database. 
This is to check that the necessary modules and call routines to 
fetch the class list is working properly. That part which fetches the 
class list is the same as that in localenroll.pm.

I checked both lonsql.log and lonsql_errors. In the first log, 
immediately after I click on the Update Roster Now, the following 
lines appears in lonsql.log. Nothing shows up in lonsql_errors - even 
after the system timed out.

Thu May  4 07:59:20 2006 (2453): <font color='red'>CRITICAL: 
---------- Starting ----------</font>
Thu May  4 07:59:41 2006 (2453): Child 2454 died

The above message seems to suggest that the child process never live 
very long. Is this normal? Also, I inserted a few print STDERR in the 
localenroll::fetch_enrollment and nothing got printed out. This 
suggests that the routine was never called - the hang up occurs 
earlier on. The lonsql is running as indicated by

www       2453  0.0  0.1  13800  5848 ?        Ss   07:59   0:00 
/usr/bin/perl /home/httpd/perl/lonsql
www       2456  0.0  0.1  13800  5888 ?        S    07:59   0:00 
/usr/bin/perl /home/httpd/perl/lonsql
www       2458  0.0  0.1  13800  5888 ?        S    07:59   0:00 
/usr/bin/perl /home/httpd/perl/lonsql
www       2460  0.0  0.1  13800  5888 ?        S    07:59   0:00 
/usr/bin/perl /home/httpd/perl/lonsql
www       2773  0.0  0.1  13800  5892 ?        S    07:59   0:00 
/usr/bin/perl /home/httpd/perl/lonsql

Are there any other things I need to check? There doesn't seem to any 
indication that a module is missing.

Thanks,
-hk



At 04:37 PM 5/3/2006, you wrote:
>Hon Kie,
>
>One question is whether the separate program (which works correctly) 
>was run as www, instead of root.  It should be run as root in order 
>to reproduce the conditions underwhich localenroll::fetch_enrollment() is run.
>
>When you use "Update Roster Now" in lonpopulate, the
>lonnet::fetch_enrollment_query() should result in a call to 
>localenroll::fetchenrollment() from lonsql on the library server  - 
>see lonsql::make_new_child().  You should check that lonsql is running.
>Look for messages in lonsql.log and lonsql_errors.
>
>Also if you add any print STDERR statements in lonsql to debug what 
>might be occurring you should find the corresponding output written 
>to /home/httpd/perl/logs/lonsql_errors.  You'll need to restart 
>loncontrol after adding any such debugging statements to 
>localenroll::fetch_enrollment().  Note: the original decision to use 
>lonsql here was motivated by a desire (in pre-loncnew days) to avoid 
>tying up a lonc/lond connection while waiting for classlist data to 
>be retrieved by localenroll::fetch_enrollment().
>If you see a timeout then that means that lonnet::get_query_reply() 
>did not find /home/httpd/perl/tmp/$queryid.end within 200 s, where 
>$queryid is the value returned after submitting the fetchenrollment 
>query in lonnet::fetch_enrollment_query()  In your example, $queryid 
>was loncapa10.fsu.edu_2460_1146674707_2.
>
>Stuart
>
>H. K. Ng wrote:
>
>>I just switched the library server and everything works fine except 
>>for the autoenroll. It timeout as indicated by the message (below) 
>>in lonnet.log. I can fetch the class list by running a separate 
>>program so I know the connection and all the necessary components 
>>have been installed. (The call routine in this program is the same 
>>as that in localenroll.pm) I inserted some lines in sub 
>>fetch_enrollment (localenroll.pm) to print out (to a file) the 
>>calls to the db and so but nothing got printed out. It seems that 
>>the changes are not called. Both loncontrol and httpd were 
>>restarted after making the changes. I must have missed something. 
>>Any suggestions?
>>
>>Thanks,
>>-hk
>>
>>
>>Wed May  3 12:48:27 2006 (2453): fetch_enrollment_query error: 
>>timeout:loncapa10.fsu.edu_2460_1146674707_2 for fsu fsudc for 
>>loncapa10.fsu.edu_2460_1146674707_2 context: updatenow 
>>8y408217acc584425fsul1 maxtries: 1 tries: 1
>>Wed May  3 12:48:27 2006 (2453): lonpopulate:update rosterfailed to 
>>retrieve classlist data for 8y408217acc584425fsul1 at fsu by fsudc @ fsu: error
>>
>>_______________________________________________
>>LON-CAPA-admin mailing list
>>LON-CAPA-admin at mail.lon-capa.org
>>http://mail.lon-capa.org/mailman/listinfo/lon-capa-admin
>>
>
>_______________________________________________
>LON-CAPA-admin mailing list
>LON-CAPA-admin at mail.lon-capa.org
>http://mail.lon-capa.org/mailman/listinfo/lon-capa-admin




More information about the LON-CAPA-admin mailing list