Multistep Sankey Graph from Dataframe

Question

I have a dataframe with the following structure

INDEX	ANO	DISTRITO	CONCELHO	NCCO
0	2013.0	Aveiro	Albergaria-a-Velha	98
1	2013.0	Aveiro	Albergaria-a-velha	1
2	2013.0	Aveiro	Anadia	41

The full dataset can be found here

This data set ranges from 2013 to 2022 (ANO), and includes 18 different districts (DISTRITO), 278 different counties (CONCELHO) and the number of forest fires per CONCELHO (`NCCO´)

I'm able to produce a one step Sankey graph with this code, that I adapted from here

df = pd.read_csv('heatmap_full.csv') #generated by ingestor.py

all_nodes = df.ANO.values.tolist() + df.DISTRITO.values.tolist() 
source_indices = [all_nodes.index(ANO) for ANO in df.ANO]
target_indices = [all_nodes.index(DISTRITO) for DISTRITO in df.DISTRITO]

colors = px.colors.qualitative.D3
node_colors = [np.random.choice(colors) for node in all_nodes]

fig = go.Figure(data=[go.Sankey(
    # Define nodes
    node = dict(
    pad = 20,
    thickness = 20,
    line = dict(color = "black", width = 1.0),
    label =  all_nodes,
    color =  node_colors,
    ),

    # Add links
    link = dict(
      source =  source_indices,
      target =  target_indices,
      value =  df.NCCO,
))])

fig.update_layout(title_text="FOREST FIRES IN PORTUGAL",
                height = 900,
                width=1200,
                font_size=18)
fig.show()

My Problem/Question

I would like to have a step after DISTRITO for CONCELHO appearing in the Sankey graph, but I can't figure it out.

Can I add a new trace to the figure? Do I need to treat my original dataset in another way?

Any help would be much appreciated

Disclosure This is not meant for commercial use.

Multistep Sankey Graph from Dataframe

Answers (1)

Related Questions