Fix docs build

Retribution98 · Retribution98 · commit e282fbf8f34f · 2024-02-08T17:14:43.000+01:00
diff --git a/docs/getting_started/using_modin/using_modin_cluster/using_modin_ray_cluster.rst b/docs/getting_started/using_modin/using_modin_cluster/using_modin_ray_cluster.rst
@@ -12,7 +12,7 @@ local development and cluster execution. Users are not required to think about
 how many workers exist or how to distribute and partition their data;
 Modin handles all of this seamlessly and transparently.
 
-.. image:: ../../../examples/tutorial/jupyter/img/modin_cluster.png
+.. image:: ../../examples/tutorial/jupyter/img/modin_cluster.png
    :alt: Modin cluster
    :align: center
    :scale: 90%
@@ -37,7 +37,7 @@ just run the following command:
 Starting and connecting to the cluster
 --------------------------------------
 
-This example starts 1 head node (m5.24xlarge) and 7 worker nodes (m5.24xlarge), 768 total CPUs.
+This example starts 1 head node (m5.24xlarge) and 5 worker nodes (m5.24xlarge), 576 total CPUs.
 You can check the `Amazon EC2 pricing`_ .
 
 You can manually create AWS EC2 instances and configure them or just use the `Ray autoscaler` to 
@@ -76,7 +76,7 @@ Executing on a cluster environment
 Modin lets you instantly speed up your workflows with a large data by scaling pandas
 on a cluster. In this tutorial, we will use a 12.5 GB `big_yellow.csv` file that was
 created by concatenating a 200MB `NYC Taxi dataset`_ file 64 times. Preparing this
-file was provided as part of our `Modin's cluster setup config`_.
+file was provided as part of our `Modin's Ray cluster setup config`_.
 
 If you want use another dataset in your own script, you should provide it to each of
 the cluster nodes in the same path. We recomnend doing this by customizing the
@@ -119,7 +119,7 @@ with improvements in performance as we increase the number of resources Modin ca
 .. _`Ray's autoscaler options`: https://docs.ray.io/en/latest/cluster/vms/references/ray-cluster-configuration.html#cluster-config
 .. _`Ray's cluster docs`: https://docs.ray.io/en/latest/cluster/getting-started.html
 .. _`NYC Taxi dataset`: https://modin-datasets.intel.com/testing/yellow_tripdata_2015-01.csv
-.. _`Modin's cluster setup config`: https://github.com/modin-project/modin/blob/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/modin-cluster.yaml
+.. _`Modin's Ray cluster setup config`: https://github.com/modin-project/modin/blob/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/modin-cluster.yaml
 .. _`Amazon EC2 pricing`: https://aws.amazon.com/ec2/pricing/on-demand/
 .. _`exercise_5.py`: https://github.com/modin-project/modin/blob/master/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.py
 .. _`Ray client`: https://docs.ray.io/en/latest/cluster/running-applications/job-submission/ray-client.html
diff --git a/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.md b/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.md
@@ -10,7 +10,7 @@
 
 **NOTE**: This exercise has extra requirements. Read instructions carefully before attempting. 
 
-**This exercise instructs users on how to start a 700+ core Ray cluster,
+**This exercise instructs users on how to start a 500+ core Ray cluster,
 and it is not shut down until the end of exercise. Read instructions carefully.**
 
 Often in practice we have a need to exceed the capabilities of a single machine.
@@ -40,7 +40,7 @@ aws configure
 
 ## Starting and connecting to the cluster
 
-This example starts 1 head node (m5.24xlarge) and 7 worker nodes (m5.24xlarge), 768 total CPUs.
+This example starts 1 head node (m5.24xlarge) and 5 worker nodes (m5.24xlarge), 576 total CPUs.
 
 Cost of this cluster can be found here: https://aws.amazon.com/ec2/pricing/on-demand/.
 
@@ -102,12 +102,14 @@ some other Python modules that should be available to execute your own script or
 
 ```bash
 # download a file from the cluster to the local computer:
-ray rsync_down cluster.yaml '/path/on/cluster' '/local/path'
+ray rsync_down modin-cluster.yaml '/path/on/cluster' '/local/path'
 # upload a file from the local computer to the cluster:
-ray rsync_up cluster.yaml '/local/path' '/path/on/cluster'
+ray rsync_up modin-cluster.yaml '/local/path' '/path/on/cluster'
 ```
 
-By running the script on clusters of different sizes, we can see how the CSV file reading time decreases as the number of nodes increases.
+Modin performance scales as the number of nodes and cores increases. The following chart shows
+the performance of the read_csv operation with different number of nodes, with improvements in
+performance as we increase the number of resources Modin can use.
 
 ![ClusterPerf](../../../img/modin_cluster_perf.png)
 
diff --git a/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.py b/examples/tutorial/jupyter/execution/pandas_on_ray/cluster/exercise_5.py
@@ -5,7 +5,7 @@
 
 ray.init(address="auto")
 cpu_count = ray.cluster_resources()["CPU"]
-assert cpu_count == 768, f"Expected 768 CPUs, but found {cpu_count}"
+assert cpu_count == 576, f"Expected 576 CPUs, but found {cpu_count}"
 
 file_size = os.path.getsize("big_yellow.csv")