How to scrape the aspx.net

Question

Python is new to me. I've been attempting to download the table from this website: https://tradereport.moc.go.th/Report/ReportEng.aspx?Report=HarmonizeCommodity&Lang=Eng&ImExType=1&Option=1 However, it appears complicated because the page source code does not have a button which needs to be clicked to show the report.

Thank you very much.

Firstly, before getting to that complicated table structure, I make it simple by testing only how to get the result table by clicking "ReviewReport" button. It shows table on the webpage but nothing I can scrape out of the page source. Please help me how to get the table data.

from bs4 import BeautifulSoup
from selenium import webdriver
from selenium.webdriver.support.ui import Select
from selenium.common.exceptions import NoSuchElementException
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By
from selenium.webdriver.chrome.options import Options
from selenium.webdriver.common.keys import Keys
import pandas as pd

Year = ('2021')
Month = ('11')
HScode = ('721990')

options = Options()
#options.headless = True
#driver = webdriver.Chrome('C:\Python\Scripts\chromedriver.exe', options=options)
driver = webdriver.Firefox(executable_path=r'C:\Python\geckodriver.exe')
driver.get("https://tradereport.moc.go.th/Report/ReportEng.aspx?      Report=HarmonizeCommodity&Lang=Eng&ImExType=1&Option=1")
time.sleep(5)
driver.get("https://tradereport.moc.go.th/Report/ReportEng.aspx?    Report=HarmonizeCommodity&Lang=Eng&ImExType=1&Option=1") #reenter the url again to get the table page
time.sleep(5)  


driver.find_element_by_id("ddlYear").send_keys(Year)  #Working fine
driver.find_element_by_id("ddlMonth").send_keys(Month)  #Working fine
driver.find_element_by_id("txtHsCode").send_keys(HScode) #Working fine
submitbttn = driver.find_element_by_id("btnSubmit")       #Working fine
submitbttn.click()

time.sleep(5)

f=open("d:\page.txt","w")
f.write(driver.page_source)
f.close()

print ("************* Scrapping data done**********************")
driver.quit`

How to scrape the aspx.net

Answers (1)

Related Questions