Analyze and interpret large datasets hosted on cloud platforms such as AWS, Azure, and Google Cloud.
Utilize cloud-based databases and data warehouses, such as Snowflake, Google BigQuery, Azure Synapse Analytics, or AWS Redshift, to perform complex data analyses.
Design and develop BI dashboards and reports using cloud-enabled tools like Tableau, Power BI, and Looker.
Write and optimize complex SQL queries for data extraction and transformation across cloud data systems.
Use Python and libraries like NumPy, Pandas, and Scikit-learn for advanced data preprocessing, statistical analysis, and predictive modeling. Remote
Implement NLP and LLM techniques on structured and unstructured data for advanced insights.
Work closely with data engineering teams to ensure proper data quality and governance in cloud systems.
Ensure scalability and performance optimization of analytical queries in a cloud-first environment.
Document all analytical processes and communicate findings effectively in Japanese and English to technical and non-technical stakeholders.
Qualification
Bachelors
Skills Required
Communication Skills: Fluency in Japanese and English, with the ability to present data insights clearly.
Data Warehouses: Proficiency in Snowflake, BigQuery, Azure Synapse Analytics, or AWS Redshift.
Query Languages: Advanced skills in SQL for querying and optimizing cloud-based databases.
BI Tools: Experience with Tableau, Power BI, or Looker, particularly in a cloud-native environment.
Python Expertise: Hands-on experience with NumPy, Pandas, and statistical modeling frameworks like Scikit-learn.
Experience in API integration for extracting and processing data from various cloud systems.
Cloud Platforms: Experience with AWS, Azure, or Google Cloud for data handling and analysis.
Technical Knowledge: Solid understanding of algorithms, mathematical concepts, and system performance optimization.
Documentation: Strong skills in maintaining and documenting processes in a cloud-centric ecosystem.
Familiarity with NLP techniques and LLMs for advanced text data analysis.
Knowledge of data governance and security best practices in the cloud.