Wildfire prediction using machine learning

Σταφυλάς, Δημήτριος

dc.contributor.advisor	Leligou, Helen C. (Nelly)
dc.contributor.author	Σταφυλάς, Δημήτριος
dc.date.accessioned	2022-08-01T09:07:21Z
dc.date.available	2022-08-01T09:07:21Z
dc.date.issued	2022-07-18
dc.identifier.uri	https://polynoe.lib.uniwa.gr/xmlui/handle/11400/2738
dc.identifier.uri	http://dx.doi.org/10.26265/polynoe-2578
dc.description.abstract	The use of supervised Machine Learning algorithms is widespread in the science of fires. The objective of this postgraduate thesis was to conduct three experiments utilizing only weather variables for the region of the Attica basin. More specifically, the prediction of the probability of fire occurrence (binary classification) for 12, 4 and 2 weather variables respectively, was implemented as first experiment, the prediction of the fire scale (multi-class classification: small fire, medium fire, large fire, wildfire) for 12 weather variables as second experiment and the prediction of the size of the burned area of forest fires for 12 and 4 weather variables as third experiment (regression task). Initially, a new dataset named “wildfire” was synthesized that included the prevailing weather conditions during the forest fires occurrences in the Attica basin. Based on this, an attempt was made to conduct the three experiments with the resulting predictions proving to be particularly impressive. The performance of the formed wildfire dataset was compared with the known prior art Montesinho dataset in order to evaluate which of the two functioned best in the application of supervised Machine Learning algorithms. The comparative results showed that for all 12 weather variables extracted by the wildfire dataset, a tuned Random Forest model (70%) outperformed other classification models regarding prediction accuracy of fire occurrence. In alternative embodiments for the best 4 and 2 selected weather features correspondingly the Extreme Gradient Boosting (XGBoost) prediction model achieved the best accuracy (67.4%) in terms of fire occurrence prediction and the Neural Networks performed marginally better (63.6%) than the Random Forest (63.3%). As for the problem of multi-class classification of fire scale prediction (small fire, medium fire, large fire, wildfire), it demonstrated that the model of the K- nearest neighbors implemented better (50%) than the other prediction models. The findings for forecasting of size of burned area of forest fires turned out that by using all the weather variables the K-nearest neighbors (r² score value 70%) outperformed other regression models while for 4 chosen weather features poor outcomes were provided by regression models with only the Linear Regression algorithm to carry out better than others (r² score value 2%). Finally, a comparison was made with the known prior art Montesinho dataset for 4 and 2 selected weather variables for the first experiment, as well as for 4 weather variables for the third experiment. The results showed that the newly created wildfire dataset functioned much better when applying the supervised Machine Learning algorithms.	el
dc.format.extent	68	el
dc.language.iso	en	el
dc.publisher	Πανεπιστήμιο Δυτικής Αττικής	el
dc.rights	Αναφορά Δημιουργού - Μη Εμπορική Χρήση - Παρόμοια Διανομή 4.0 Διεθνές	*
dc.rights	Αναφορά Δημιουργού 4.0 Διεθνές	*
dc.rights	Αναφορά Δημιουργού 4.0 Διεθνές	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Machine learning	el
dc.subject	Wildfire	el
dc.subject	Random forest	el
dc.subject	Support Vector Machines	el
dc.subject	Logistic regression	el
dc.subject	Linear regression	el
dc.subject	Neural networks	el
dc.subject	Decision trees	el
dc.subject	Extreme gradient boosting	el
dc.subject	K-nearest neighbors	el
dc.subject	Fire occurrence	el
dc.subject	Fire scale	el
dc.subject	Burned area	el
dc.subject	Μηχανική μάθηση	el
dc.subject	Πυρκαγιές	el
dc.subject	Αλγόριθμοι μηχανικής μάθησης	el
dc.subject	Δασικές πυρκαγιές	el
dc.title	Wildfire prediction using machine learning	el
dc.title.alternative	Πρόβλεψη πυρκαγιάς με χρήση μηχανικής μάθησης	el
dc.type	Μεταπτυχιακή διπλωματική εργασία	el
dc.contributor.committee	Papadopoulos, Perikles
dc.contributor.committee	Παπουτσιδάκης, Μιχαήλ
dc.contributor.faculty	Σχολή Μηχανικών	el
dc.contributor.department	Τμήμα Ηλεκτρολόγων και Ηλεκτρονικών Μηχανικών	el
dc.contributor.department	Τμήμα Μηχανικών Βιομηχανικής Σχεδίασης και Παραγωγής	el
dc.contributor.master	Τεχνητή Νοημοσύνη και Βαθιά Μάθηση	el
dc.description.abstracttranslated	Η χρήση εποπτευόμενων αλγορίθμων Μηχανικής Μάθησης είναι ευρέως διαδεδομένη στην επιστήμη των πυρκαγιών. Στόχος της παρούσας μεταπτυχιακής διατριβής ήταν η διεξαγωγή τριών πειραμάτων χρησιμοποιώντας μόνο μεταβλητές καιρού για την περιοχή του λεκανοπεδίου της Αττικής. Πιο συγκεκριμένα, η πρόβλεψη της πιθανότητας εκδήλωσης πυρκαγιάς (δυαδική ταξινόμηση) για 12, 4 και 2 μεταβλητές καιρού αντίστοιχα, εφαρμόστηκε ως πρώτο πείραμα, η πρόβλεψη της κλίμακας πυρκαγιάς (ταξινόμηση πολλαπλών κατηγοριών: μικρή φωτιά, μέτρια φωτιά, μεγάλη φωτιά, πυρκαγιά) για 12 μεταβλητές καιρού ως δεύτερο πείραμα και η πρόβλεψη του μεγέθους της καμένης έκτασης δασικών πυρκαγιών για 12 και 4 μεταβλητές καιρού ως τρίτο πείραμα (εργασία παλινδρόμησης). Αρχικά, συντέθηκε ένα νέο σύνολο δεδομένων με το όνομα «wildfire» που περιελάμβανε τις επικρατούσες καιρικές συνθήκες κατά τη διάρκεια των εκδηλώσεων δασικών πυρκαγιών στο λεκανοπέδιο της Αττικής. Με βάση αυτό, έγινε προσπάθεια να διεξαχθούν τα τρία πειράματα με τις προβλέψεις που προέκυψαν να αποδεικνύονται ιδιαίτερα εντυπωσιακές. Η απόδοση του διαμορφωμένου συνόλου δεδομένων πυρκαγιάς συγκρίθηκε με το γνωστό σύνολο δεδομένων προηγούμενης τεχνολογίας Montesinho προκειμένου να αξιολογηθεί ποιο από τα δύο λειτουργούσε καλύτερα στην εφαρμογή εποπτευόμενων αλγορίθμων Μηχανικής Μάθησης. Τα συγκριτικά αποτελέσματα έδειξαν ότι και για τις 12 μεταβλητές καιρού που εξήχθησαν από το σύνολο δεδομένων πυρκαγιάς, ένα συντονισμένο μοντέλο Τυχαίας Δασικής Πυρκαγιά (70%) ξεπέρασε τα άλλα μοντέλα ταξινόμησης όσον αφορά την ακρίβεια πρόβλεψης της εκδήλωσης πυρκαγιάς. Σε εναλλακτικές υλοποιήσεις για τα καλύτερα 4 και 2 επιλεγμένα χαρακτηριστικά καιρού, αντίστοιχα, το μοντέλο πρόβλεψης Extreme Gradient Boosting (XGBoost) πέτυχε την καλύτερη ακρίβεια (67,4%) όσον αφορά την πρόβλεψη εκδήλωσης πυρκαγιάς και τα νευρωνικά δίκτυα είχαν οριακά καλύτερη απόδοση (63,6%) από το Random Δάσος (63,3%). Όσον αφορά το πρόβλημα της πολλαπλής ταξινόμησης της πρόβλεψης κλίμακας πυρκαγιάς (μικρή πυρκαγιά, μεσαία πυρκαγιά, μεγάλη πυρκαγιά, πυρκαγιά), έδειξε ότι το μοντέλο των Κ-πλησιέστερων γειτόνων εφαρμόστηκε καλύτερα (50%) από τα άλλα μοντέλα πρόβλεψης. Τα ευρήματα για την πρόβλεψη του μεγέθους της καμένης περιοχής των δασικών πυρκαγιών προέκυψαν ότι χρησιμοποιώντας όλες τις μεταβλητές καιρού οι K-πλησιέστεροι γείτονες (τιμή βαθμολογίας r² 70%) ξεπέρασαν τα άλλα μοντέλα παλινδρόμησης ενώ για 4 επιλεγμένα καιρικά χαρακτηριστικά δόθηκαν φτωχά αποτελέσματα από μοντέλα παλινδρόμησης με μόνο τον αλγόριθμο Γραμμικής παλινδρόμησης να εκτελείται καλύτερα από άλλους (τιμή βαθμολογίας r² 2%). Τέλος, έγινε σύγκριση με το γνωστό σύνολο δεδομένων προηγούμενης τεχνικής Montesinho για 4 και 2 επιλεγμένες μεταβλητές καιρού για το πρώτο πείραμα, καθώς και για 4 μεταβλητές καιρού για το τρίτο πείραμα. Τα αποτελέσματα έδειξαν ότι το νέο σύνολο δεδομένων wildfire λειτούργησε πολύ καλύτερα κατά την εφαρμογή των εποπτευόμενων αλγορίθμων Machine Learning.	el

Files in this item

Name:: thesis_june_16_d Α μπ.pdf
Size:: 3.118Mb
Format:: PDF
Description:: Main article

View/Open

Name:: Copyright.pdf
Size:: 207.8Kb
Format:: PDF
Description:: copyrights

View/Open

This item appears in the following Collection(s)

Μεταπτυχιακές διπλωματικές εργασίες - Τεχνητή Νοημοσύνη και Βαθιά Μάθηση
Μεταπτυχιακές διπλωματικές εργασίες ΠΜΣ Τεχνητή Νοημοσύνη και Βαθιά Μάθηση

Show simple item record

Except where otherwise noted, this item's license is described as
Αναφορά Δημιουργού - Μη Εμπορική Χρήση - Παρόμοια Διανομή 4.0 Διεθνές