Optimizing AWS Costs for big data workloads