Β
Β
Β
-
Design a pricing model that accommodates a diverse range of audience needs,
- Addressing the limitations of the current pricing system, which overlooks varying interests and preferences
- Proposing new seat grouping standards based on audience profiles and concert characteristics
- Enhancing accessibility and enjoyment for all concertgoers
-
Ensure transparency in pricing decisions through data-driven justification,
- Providing logical, evidence-based pricing grounds
- Making the pricing strategy more open and trustworthy
- Empowering audiences with clear and reliable pricing information
- Data preprocessing & EDA
- Audience Seat Preference Patterns by Genre
- Demand Prediction Modeling
- Price Modeling
We hypothesized that seat preferences would vary by genre, so we split the data by genre and performed K-Means clustering on seat-level data within each group.
To determine the optimal number of clusters, we analyzed the Elbow Curve along with the average and distribution of silhouette coefficients.
- Left (Symphony): Preference for Block D over Block B (left side), possibly due to the orchestra layout where soloists face the conductorβs left
- Middle (Classical): Slight preference for central seating compared to symphony; extreme side seats may distort balance between instruments
- Right (Choral): Strong preference for broad central zones, offering balanced audio from diverse vocal sections
We performed two separate regression tasks:
- R: General reservation rate
- M: Membership conversion rate
- Performance start time
- Pre-sale status
- Genre
- International artist (yes/no)
- Running time
- Day of week
- Month
- Data split: 80% train / 20% test
- Cross-validation: 5-fold CV during GridSearch
- Evaluation metrics: MSE, MLSE
CatBoost showed the lowest MSE, so it was selected as the final model for predicting general reservation rate (R).
Although MLP achieved the lowest MSE for membership conversion rate (M), CatBoost was chosen for its better interpretability.
Comparison of MSE and MLSE scores across 6 regression models.
SHAP value analysis was conducted after model training to extract feature-level insights.
K-Means clustering on performance features resulted in 6 clusters, validated through silhouette score analysis.
The base prices for the highest and lowest seat grades were set using the average of the highest and lowest prices within each assigned cluster.
-
Automated seat grouping and price estimation for new performances based on input characteristics
-
Enhanced seat selection interface that displays additional insights, such as how many customers with certain profiles previously reserved a given seat grade
-
Transparent pricing rationale, helping to build audience trust
-
Data-driven guidance to support more informed purchase decisions
- Enables optimal pricing by adjusting prices based on predicted demand, maximizing both attendance and revenue
- Improves customer satisfaction through reasonable pricing and clearer seat categorization, encouraging repeat visits and revitalizing interest in classical concerts
- Provides a clear guideline for selecting appropriate seats based on individual interest in classical performances
- Helps maximize consumer utility within each individual's willingness to pay
- Python (pandas, sklearn, matplotlib)
- Linear Regression, KMeans Clustering
/bigcontest_2023_pitch.pdfβ Full presentation slides
Copyright 2023. Miraclassic de BOAZ All rights reserved.





