8. Questions on Airflow Service Issues¶
Here is a list of FAQs that are related to Airflow service issues with corresponding solutions.
Which logs do I look up for Airflow cluster startup issues?
Refer to Airflow Services logs which are brought up during the cluster startup.
Where can I find Airflow Services logs?
Airflow services are Scheduler, Webserver, Celery, and RabbitMQ. The service logs are available at
/media/ephemeral0/logs/airflowlocation inside the cluster node. Since airflow is single node machine, logs are accessible on the same node. These logs are helpful in troubleshooting cluster bringup and scheduling issues.What is $AIRFLOW_HOME?
$AIRFLOW_HOMEis a location that contains all configuration files, DAGs, plugins, and task logs. It is an environment variable set to/usr/lib/airflowfor all machine users.Where can I find Airflow Configuration files?
Configuration file is present at “$AIRFLOW_HOME/airflow.cfg”.
Where can I find Airflow DAGs?
The DAGs’ configuration file is available in the
$AIRFLOW_HOME/dagsfolder.Where can I find Airflow task logs?
The task log configuration file is available in
$AIRFLOW_HOME/logs.Where can I find Airflow plugins?
The configuration file is available in
$AIRFLOW_HOME/plugins.How do I restart Airflow Services?
You can do start/stop/restart actions on an Airflow service and the commands used for each service are given below:
- Run
sudo monit <action> schedulerfor Airflow Scheduler. - Run
sudo monit <action> webserverfor Airflow Webserver. - Run
sudo monit <action> workerfor Celery workers. A stop operation gracefully shuts down existing workers. A start operation adds more equivalent number of workers as per the configuration. A restart operation gracefully shuts down existing workers and adds equivalent number of workers as per the configuration. - Run
sudo monit <action> rabbitmqfor RabbitMQ.
- Run
How do I invoke Airflow CLI commands within the node?
Airflow is installed inside a virtual environment at the location specified in the environment variable
AIRFLOW_VIRTUALENV_LOC. Firstly, you should activate the virtual environment using the following script:source ${AIRFLOW_HOME}/airflow/qubole_assembly/scripts/virtualenv.sh activate
After you activate the virtual environment, run the Airflow command.