Spaces:

SustainabilityLabIITGN
/

VayuChat

Running

Nipun Claude commited on Aug 23

Commit

8673e7c

1 Parent(s): 7a86a16

Expand system prompt with comprehensive coding best practices

Add 35+ generic coding rules covering:
- Data validation & safety (6 rules)
- Variable & type handling (6 rules)
- Pandas operations (6 rules)
- Matplotlib & plotting (6 rules)
- Error prevention (7 rules)

This provides LLM with broad toolkit for robust code generation across diverse scenarios.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <[email protected]>

Files changed (1) hide show

src.py +40 -6

src.py CHANGED Viewed

@@ -311,12 +311,46 @@ SAFETY & ROBUSTNESS RULES:
 - Use proper string formatting for answers with units (μg/m³)
 CRITICAL CODING PRACTICES:
-- Always convert pandas/numpy objects to proper Python types before operations
-- Use descriptive variable names (avoid single letters in complex logic)
-- Ensure all variables are defined before use
-- Convert datetime/period objects to appropriate types (.astype(str), .astype(int))
-- Reference DataFrame properly in all column operations: df['column'] not 'column'
-- Use proper data type checking and conversion for all operations
 TECHNICAL REQUIREMENTS:
 - Save final result in variable called 'answer'

 - Use proper string formatting for answers with units (μg/m³)
 CRITICAL CODING PRACTICES:
+DATA VALIDATION & SAFETY:
+- Always check if DataFrames/Series are empty before operations: if df.empty: return
+- Use .dropna() to handle missing values or .fillna() with appropriate defaults
+- Validate column names exist before accessing: if 'column' in df.columns
+- Check data types before operations: df['col'].dtype, isinstance() checks
+- Handle edge cases: empty results, single row/column DataFrames, all NaN columns
+- Use .copy() when modifying DataFrames to avoid SettingWithCopyWarning
+VARIABLE & TYPE HANDLING:
+- Use descriptive variable names (avoid single letters in complex operations)
+- Ensure all variables are defined before use - initialize with defaults
+- Convert pandas/numpy objects to proper Python types before operations
+- Convert datetime/period objects appropriately: .astype(str), .dt.strftime(), int()
+- Always cast to appropriate types for indexing: int(), str(), list()
+- Use explicit type conversions rather than relying on implicit casting
+PANDAS OPERATIONS:
+- Reference DataFrame properly: df['column'] not 'column' in operations
+- Use .loc/.iloc correctly for indexing - avoid chained indexing
+- Use .reset_index() after groupby operations when needed for clean DataFrames
+- Sort results for consistent output: .sort_values(), .sort_index()
+- Use .round() for numerical results to avoid excessive decimals
+- Chain operations carefully - split complex chains for readability
+MATPLOTLIB & PLOTTING:
+- Always call plt.close() after saving plots to prevent memory leaks
+- Use descriptive titles, axis labels, and legends
+- Handle cases where no data exists for plotting
+- Use proper figure sizing: plt.figure(figsize=(width, height))
+- Convert datetime indices to strings for plotting if needed
+- Use color palettes consistently
+ERROR PREVENTION:
+- Use try-except blocks for operations that might fail
+- Check denominators before division operations
+- Validate array/list lengths before indexing
+- Use .get() method for dictionary access with defaults
+- Handle timezone-aware vs naive datetime objects consistently
+- Use proper string formatting and encoding for text output
 TECHNICAL REQUIREMENTS:
 - Save final result in variable called 'answer'